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Add Bluetooth to your 
MicroBlaze-based designs. 


“Embuiided j 
*Bevebapene 





(It doesn’t get any easier than this.) 


Now there's a faster, easier way to build and test a Bluetooth® system using 
embedded processors. It's the Embedded Bluetooth Development Kit from 
Memec Design, and it provides everything you need to add a Bluetooth inter- 
face to existing Xilinx® MicroBlaze™ designs. Developed with industry-leading 
partners, this unique kit can help you save weeks of development time in your 
challenging next-generation applications. To find out more, contact us at 


www. insight.na.memec.com/xilinx_bluetooth. 
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Xilinx Wins Again 


ilinx has built its reputation on the triumvirate of silicon, software, and support. 
And although silicon and software get the most news coverage, it’s the support services 
team that holds it all together. 


The support.xilinx.com website — and the personalized mysupport.xilinx.com portals — are accessed 
by more than 110,000 users each month, and are credited for saving customers substantial time 
to knowledge, which translates to faster time to market. 
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In fact, the MySupport.xilinx.com support service is so good that the é. Award 

Association of Support Professionals has named it to its list of “The Year's A pee g 

Ten Best Web Support Sites.” Xilinx joins fellow industry leaders Dell, % ~ 


Hewlett Packard, Mentor Graphics, and Novell on the list of companies The Year's Ten Best 
awarded for excellence in online service and support. Web Support Sites 


To select this year’s winners, judges evaluated companies on such criteria as usability, design, navi- 
gation, and knowledge base, as well as search implementation, interactive features, personalization, 
and major site development. 


According to Jeffrey Tarter, executive director of the Association of Support Professionals, “Users 
are looking for highly interactive technical support services they can personalize to fit their needs 
and can access in real time. Xilinx is a leader in delivering the kind of online support customers 
are looking for.” 


Co-winner Mentor Graphics also praised MySupport.xilinx.com: “Many of our customers have told 
us that the quality and coverage of support.xilinx.com’ technical content allows them to readily 
understand complex FPGA architectures.” 


As we said, Xilinx support doesn’t get as much publicity as Xilinx silicon and software, but it is 
just as important. 


Don Mullikin, vice president of Xilinx Global Services, puts it this way: “Design teams tackling 
today’s most complex system-design challenges need support that matches the caliber of the technical 
capabilities of our products.” 


Our award-winning support matches our award-winning hardware and software, so you get a 
complete design solution that is the best in the industry. 


Tom Durkin 
Managing Editor 


Correction 
In the summer 2003 edition of Xcell Journal, the URL for more information on RocketPHY™ transceivers 


was incorrectly reported in the article “New RocketPHY Transceiver Family Debuts at 10 Gbps.” The correct 
URL is www.xilinx.com/rocketphy/. We regret this error and any inconvenience it may have caused. 
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Performance + Time = Memory 


By approaching FPGA designs as 
three-dimensional endeavors, you can 
radically reduce device size — and cost. 
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fifth-generation vehicle control system 





YAe5)| olulsare) 


View {fons el 0) eeee........aaeneeanenneinee nt al 5 
ISE 6.1i Continues to Lower Design Costs ..........cccsssssesssseessseees ) 


Put Hardware in the Loop with System Generator for DSP........... 8 


Get Physical with the Palace Synthesis Solution .................00. 1] 
Seamless Co-Verification Accelerates Time to Market .............0 14 
In FPGA Synthesis, Timing Is Everything ............sssscssccssssseees 18 
Advanced Switching Expands PCI Express.............csssssssesseseen 20 
Get the Platform Flash PROMIse ...............ssssssssssssecssseeeeeeeeee 1B 
Performance + Time = Memory ........sssssessssscssssseeeecsesssssseeees 24 
How to Implement a DDR SDRAM Controller ..............0e.s-ss0e. 30 
ISAOUMERGADESIgNESECUIG!, ser eaeeemnnneenn een nnernes 32 
Virtex FPGAs Tame DSP-Hungry Designs.............scscsssseccceseees 36 
Biology Goes Digital — The BioWall Story ...........sescssccssssseees 40 
Formula 1 Racing: The Xilinx Advantage ...........ssssccscccssssseeee 46 
CoolRunner-ll Has Hidden Talents ............sssssssssscccccccecesseseen 50 
Boeing Uses Virtex-lI Pro for Image Processing ................sssse+. 53 
Explore the CoolRunner-Il Design Kit.........ssesccssssesscssseeeceesseee 58 
Develop Applications Right Out of the BOX..........c:...ssssssseeee. 61 
Memec Design Simplifies Bluetooth Development .................. 64 
Save Time, Money, and Space .......cessssssssesscssseesssesssesseesseees 68 
Toke the Titanium Solution .........ssssssssesssscsccccccsessessnsenenes 7] 
Maximize Your Success with Spartan-3 Designs ...........:..-..0++ 72 
It’s a Programmable World .............ssssesscsssesessessseeecssssseeeeeees 74 
Xilinx Partner Yellow Pages.............sessssssssesccsssseecsessseeeessseees 77 
Reference Pages.......:.0 84 


To subscribe to the printed Xcell Journal, 
or to view the Web-based Xcell Online Journal, visit: 
www.xilinx.com /publications/xcellonline/ 








View 


ELom Ene cep) 


Innovation, Education, Exploration 


From Programmable World in 2003 to landing on Mars in 2004, 
Xilinx leads the industry in new products and new applications. 


Xilinx 


and services con- 


products 


tinue to expand 
the limits of possi- 
bility. We strive to 
make the best and 


most innovative 





products in the 
programmable 


by Wim Roelandts 
CEO, Xilinx, Inc. 


as a complete solution that gives you every- 


logic industry — 
and deliver them 


thing you need to bring your product to 
market first. Here are a few of our innova- 
tions that may interest you. 


Programmable World 

On May 6, Xilinx launched Programmable 
World 2003 — a nine-month international 
initiative designed to help you understand 
and use the latest programmable logic solu- 
tions. It includes technical forums, exhibits, 
and workshops in dozens of cities around the 
world. By the time it ends in November, 
Programmable World 2003 will have 
reached more than 10,000 executives, system 
architects, engineers, and project managers 
worldwide. Co-sponsored by more than 30 
industry leaders — including Agilent, 
Cadence, IBM, Intel, Texas Instruments, 
Mentor Graphics, and Wind River Systems — 
Programmable World 2003 started in North 
America, then went to Europe, on to China 
and Taiwan, and will finish in Japan and 
Korea in November. 

Following the worldwide forums, which 
ended in August, we have been holding in- 
depth technical workshops, including multi- 
ple full-day courses on high-speed DSP, 
connectivity, processing, and system design. 

At the Programmable World 2003 forum 
in Japan this summer, Dr. Tsugio Makimoto, 
chief technical officer of Sony, asserted that 
field programmability is “hitting its peak” 
from 1997 through 2007. Dr. Makimoto 
stated a second wave of digital consumer 
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products will become the mainstay for 
future market expansion, just as personal 
computing products drove the first wave of 
the digital revolution. “Field programmabil- 
ity is a must in this new era of digital con- 
sumer goods,” Dr. Makimoto said. “These 
products have short time-to-market win- 
dows, and product life cycles that come 
quickly and end dramatically. Flexible, agile 
solutions are critical.” 





Xilinx Goes to Mars 
NASA 
Exploration Rover (MER) missions, which 


recently launched two Mars 


depend on the most advanced, radiation- 
tolerant Virtex™ FPGAs in critical applica- 
tions for both the landers and rover vehicles. 
The Spirit MER was launched June 10 and 
the Opportunity MER was launched July 7. 

Chosen because of their reprogramma- 
bility and high densities, the Virtex FPGAs 
serve as the main brain of the motor control 
boards. These FPGAs will be used to control 
the pyrotechnic devices that will cushion the 
touchdown of the landers in January 2004. 
The Xilinx FPGAs will also direct the motor 
control functions on the rovers, including 
controllers for the wheels, steering, and 
antenna gimbals. 


The radiation-tolerant Virtex devices on 
the two MER missions have up to one million 
system gates and total ionizing dose guaran- 
teed to 100 kRads(si). We will soon announce 
a next-generation family of radiation-tolerant 
Virtex-II products with up to six million sys- 
tem gates and total ionizing dose guaranteed 


to 200 kRads(si). 


RocketPHY 

In May, we introduced our new RocketPHY™ 
stand-alone, SONET-compliant, 10 Gbps 
transceivers. These devices are among the 
industry's first 10 Gbps products using CMOS 
process technology. 

The RocketPHY transceivers are the first 
products from Xilinx that are not directly 
related to programmable logic devices, but 
they are designed to work with our FPGAs. 
These transceivers use the same high-speed 
I/O technology as our Virtex-H] Pro™ X 
FPGAs in application-specific designs that 
solve many I/O challenges — at far less cost 
than any competing product in networking 
applications. 


Conclusion 

Xilinx continues to innovate in technology 
and penetrate new markets. Although our 
process technology, devices, and software 
are continually setting the highest standards 
for performance, quality, and reliability, we 
are also setting new standards in other criti- 
cal areas that affect your life. You'll find that 
our products, our support services, and our 
educational programs work together to 
bring you a cohesive, faster time to market, 
and lowest cost path to your next product 
introduction. There simply is no easier, 
faster, or lower cost way to develop flexible 
new products than with Xilinx silicon, soft- 
ware, and support. & 
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by Lee Hansen 


ISE 6.1i Continues fo fam 

- y leweuesk ees ionuer Miliax Inteornued 
Lower Design Costs | 
The latest release of Xilinx programmable i neneenn nenreernee 


tinues the Xilinx commitment to lower 


design sonware SONS YOU time == and’ money. your design costs, to deliver ease of use 


that focuses on solving your specific 


Software Environment programmable 
design tools — ISE 6.1i — is now avail- 
able to registered in-maintenance Xilinx 


customers. 


engineering problems, and to expand 
support for low-cost Spartan™-3 
FPGAs — and native support for the Red 


Hat™ Linux™ operating system. 


Some of the new capabilities in ISE 
6.11 include: 


* Support for the largest Spartan-3 
XC3S4000 and XC3S5000 devices 


¢ Increased Spartan-3 device support 


in ISE WebPACK™ and ISE BaseX 
¢ New ISE native Linux support 
e Automatic Web software update 


¢ PACE (Pinout and Area Constraints 
Editor) enhancements for CPLD, 


git : 


nn Sete poe aie ! ee ee ' te and PC board design assistance 


¢ Project Navigator mixed-language 
and embedded design enhancements 


¢ As much as 15% better performance 


than ISE 5.1i. 


ISE Shrinks Costs 

Spartan-3 Platform FPGAs are the 
world’s lowest cost FPGA device family. 
ISE 6.1i software will help you lower the 
cost of using Spartan-3 FPGAs even fur- 
ther with these new features. 


More Spartan-3 Devices in ISE 

ISE WebPACK 6.1i, which is free to 
designers, has added support for the 
Spartan-3_ XC3S200 and XC3S400 
devices, at 200,000 and 400,000 system 
gates, respectively. ISE BaseX now sup- 
ports the XC3S400 device as well. 





6 Xcell Journal Fall 2003 





High Density Spartan-3 Support 

ISE Foundation™ and ISE Alliance 
Series™ 6.1i the 
highest-density Spartan-3 devices, the 
XC3S4000 and XC3S5000 FPGAs at 4 


million and 5 million gates, respectively. 


versions support 


Now you have even greater flexibility in 
choosing Spartan-3 devices and 
ISE configurations. 


ISE Supports Native Linux 

ISE 6.1i is also the first ISE release 
that runs on native Red Hat 
Linux versions 7.3 and 8.0. The 
installation CDs for 32-bit Linux 
that come with your ISE 6.1i 
shipment will help you make the 
best use of your corporate pro- 
grammable design platforms. 


Automatic Web Update 

Another new capability is auto- 
matic Web software update. Upon 
execution, ISE will notify you 
whenever a new service pack 
upgrade is available. And if selected, ISE will 
only download those parts of the service pack 
that apply to your unique installation. This 
feature saves you the time of identifying and 
updating your current software configuration 
—and minimizing the required disk space. 


Ease of Use Engineers Demand 

Many software design companies settle for 
a good look-and-feel graphical user inter- 
face as their standard for “ease of use.” 

ISE 6.1i goes beyond being just another 
pretty GUI. It focuses on solving engineer- 
ing bottlenecks and design headaches that 
hinder your design process and progress. 


PACE Enhancements 

Figure 1 shows an example of PACE 
(Pinout and Area Constraints Editor), 
introduced with ISE 5. PACE delivers pin 
definition and area management in an easy- 
to-use, graphically oriented environment. 
You can speed your design flow faster and 
easier with PACE. 

PACE now offers CSV (comma separat- 
ed value) file import and export. This capa- 
bility offers you new flexibility in PC board 
design, including the ability to create pin 
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tables in Microsoft™ Excel™ spreadsheets 
and import those into PACE. If the pin 
tables are modified, they can be exported 
back to the Excel workbook using the CSV 
format. This export/import capability eases 
the job of integrating the logic device into 
the board layout. 
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Figure 1 — Pinout and Area Constraints Editor (PACE) 


PACE can also import and export 
VHDL and Verilog™ HDL files, which 
allow PACE to define I/O from the HDL 
port definitions — or write top-level HDL 
starting templates. 

New design check capabilities help you 
predict output problems. PACE contains a 
new package flight-time display that 
graphically shows pin-delay time that is 
based on pin-to-pad estimates. PACE also 
checks for simultaneous switched outputs 
to prevent common high-drive strength 
outputs that could potentially create 
ground bounce signal problems. 

The new version of PACE also supports 
an enhanced auto-floorplanning capability, 
which lets you identify area groups using 
PACE. Once a logic area group is identified, 
the ISE place-and-route tools create the 
floorplan, which saves you more design time. 


Expanded Project Navigator 

Project Navigator, the ISE design and 
project manager, has also been enhanced 
in version 6.1i. Project Navigator now 
supports mixed-language Verilog and 
VHDL design for customers using 
Synplicity® Corp.’s Synplify® tool suite or 
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XST (Xilinx Synthesis Technology) soft- 
ware for their synthesis solutions. This 
new flexibility allows managers to mix the 
best possible design source code for any 
particular project. This, in turn, allows 
you to more easily and quickly mix and 
match your purchased IP with your own 
in-house design expertise regard- 
less of design language. 

Project Navigator also now 
links to the Xilinx EDK 
(Embedded Design Kit) XPS 
project manager supporting 
MicroBlaze™ and Virtex-II 
Pro™ embedded 
designs. This new integration 


processor 


shows an embedded project enti- 
ty along with the design logic, 
and launches XPS when double- 
clicked, offering the first in a new 


series of upcoming enhancements 





that will bring Xilinx logic and 
embedded programmable design 
tools closer together. 


Still the Fastest 

ISE 6.1i continues to deliver the fastest pro- 
grammable device performance available. 
Enhancements to our lightning-quick 
ProActive Timing Closure implementation 
technology now deliver up to 15% better 
performance over ISE 5.1i software. 

The new INPUT_JITTER timing con- 
straint lets an engineer describe system jit- 
ter and clock edge uncertainty. With more 
timing constraints, high-speed design rules, 
and local clocking options than any other 
programmable vendor, ISE 6.1i gives you 
the ability to design high-speed memory 
interface timing and double-data-rate local 
clock designs accurately. 


Conclusion 

ISE continues to define the standard of 
logic design. By concentrating on cost, 
productivity, and ease of use, ISE is deliver- 
ing the tools necessary for programmable 
systems design that helps you squeeze the 
most out of your logic device. 

To find out more about ISE 6.1i, go to 
www.xilinx.com/xcell_ise/, To order your 
copy of ISE 6.1i, contact your local sales 
support representative. &£ 
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Put Hardware in the Loop with 
Xilinx System Generator tor DSP 


Co-simulating with hardware in the loop gives you faster simulations and eases hardware verification. 
System Generator for DSP Version 3.1 now lets you include FPGA hardware in Simulink simulations. 


by Nabeel Shirazi 

Senior Staff Software Engineer 
Xilinx, Inc. 
nabeel.shirazi@xilinx.com 


Jonathan Ballagh 

Senior Software Engineer 
Xilinx, Inc. 

jonathan. ballagh@xilinx.com 


FPGA hardware can now be included in 
simulations controlled by the MATLAB™ 
language and its Simulink™ design tools. 
At the push of a button, the Xilinx System 
Generator for DSP Version 3.1 software 
tool produces an implementation of your 
model that is ready to run in hardware. 
System Generator uses a special co- 
simulation block to control the design 
hardware during Simulink simulations. 
The co-simulation block looks just like 
traditional System Generator blocks. Its 
ports have names and types that match the 
ports on the original System Generator 
subsystem. The co-simulation flow for this 
architecture is shown in Figure 1. The sim- 
ulation behavior of the co-simulation block 
is bit- and cycle-accurate when compared to 
the behavior of the original subsystem. 
System Generator’s ability to incorporate 
hardware in the loop enables significant sim- 
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ulation speedups, provides incremental 
hardware verification capabilities, and 
removes many of the hurdles to getting a 
design up and running in an FPGA, 


Hardware Co-Simulation Flow 

Together, System Generator and hardware- 

in-the-loop co-simulation solve many 

problems normally associated with simu- 

lating and verifying a Xilinx FPGA design. 

¢ You don’t have to know a hardware 

description language. System 
Generator performs automatic code 
generation when your model is trans- 
lated into hardware. 


HDL 
Co-Simutation 


Simulute 
Legacy HDL 








¢ You don’t need in-depth knowledge of 
the board that hosts the FPGA. System 
Generator makes sure the correct hard- 
ware platform is targeted and imple- 
ments the design accordingly. 


¢ You need not create a separate test bench 
application. The co-simulation block 
can be used with the same Simulink test 
bench apparatuses that were used to test 
the original System Generator model. 


The starting point is the System 


Generator model — or subsystem — that 
you want to co-simulate. You identify the 
subsystem by dragging and dropping a 


Hordwore 
in the Loop 


y Horéware 


Verification & 
Implementetion Acceleration 


Figure 1 - MATLAB/Simulink and hardware co-simulation 
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compilation block from the XtremeDSP™ 
development kit library, for example, into 
the design, as shown in Figure 2. 

A subsystem that contains a compilation 
block should also contain a System 


block. 


Generator uses the compilation 


Generator System 
block to provide information 
about the underlying hardware 


= | 
eS = 
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Figure 2 — System Generator block and an XtremeDSP compilation block 





Figure 3 — System Generator dialog box 
with inactive product family, device, 
speed, and package fields 


and how to prepare the model for co-simu- 
lation. Pressing the “generate” button on a 
System Generator block that has an accom- 
panying compilation block automatically 
invokes the hardware co-simulation flow. 
Some of the information obtained 
from a compilation block overrides values 
on the System Generator block parame- 
ters dialog box. The parameters that are 
overridden appear as grayed-out fields in 
the System Generator dialog box and can- 
not be modified. For example, the System 
Generator dialog box is shown for a sub- 
system that also contains an XtremeDSP 
compilation block. Notice in Figure 3 
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that the product, device, speed, and pack- 
age fields have been grayed-out and are 
inactive. These parameters are used by 
System Generator when you press the 
“generate” button and the model is trans- 
lated into hard- 
ware. This 
ensures that the 
correct device 
information is 
preserved when 
the FPGA config- 
uration bit file is 
generated. 

Generating a 
model for hard- 
ware co-simulation 
includes operations 
such as producing 
HDL code and 
netlists and invok- 
ing the Xilinx 
CORE Gen- 
erator™ tool. After System Generator has 
generated a model, it invokes a compilation 
block script. Using the output from System 
Generator, the compilation block script 
invokes the tools necessary to produce a 
configuration file for the FPGA. Figure 4 
illustrates the XtremeDSP compilation 
block script running ngdbuild. 

The script creates a new library and adds 
a co-simulation block that is parameterized 
with information from the original subsys- 
tem. This information includes subsystem 


interface information such as port names, 
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port data rates, port data types, and port 
directions. Figure 5 shows the library and 
co-simulation block created by the compila- 
tion block script for the cosim_ex model. 

Figure 6 shows the co-simulation 
block first illustrated in Figure 5 as it is 
brought back into the cosim_ex model for 
simulation. Note that the ports on the co- 
simulation block match the port names 
on the input and output gateway blocks. 
The waveform in the scope shows equiva- 
lent output results for the AddSub block 
and the co-simulation block. The block’s 
inputs must be the same signal type, as 
shown in Figure 7. 


Using Co-Simulation Blocks in Simulink 

A co-simulation block is a Simulink block 
that interacts with an underlying co- 
simulation platform. This block behaves 
as a normal Simulink block and can be 
simulated transparently with other 
blocks. Each co-simulation block has a 
port interface that is parameterized by the 
compilation block script to match the 
ports on the original subsystem. 

System Generator co-simulation blocks 
can be driven by either Xilinx fixed-point 
data types or by double-signal data types. 
The hardware co-simulation block must 
be of the same type of signals (bit width, 
binary point position, signed/unsigned) 
that were used in the original model. If 
there are no inputs to the block, the block 
allows you to choose between Xilinx fixed- 


point or double-signal output types. 





Figure 4 — Compilation block script for the XtremeDSP development kit 
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Figure 5 — Co-simulation library — 
cosim_ex_hwcosim_lib — 
created by the post-generation script. 


When System Generator implements a 
model in hardware, it uses a single clock 
source to drive the design. Multi-rate 
blocks are supported by using derived clock 
enables. These clock enables have periods 
that are integer multiples of the period of 
the system clock. Data traveling in and out 
of the hardware also has a period that is a 
multiple of the system clock period. These 
multiples, or ratios, between the port data 
rates and the system clock rate are stored in 
the co-simulation block by the compilation 
block script when it is instantiated. 

A co-simulation block must be driven 
by ports with data rates equivalent to the 
gateway data rates of the original subsys- 
tem. The block automatically adjusts its 
output ports to run at the appropriate 
rates. If the block detects an incorrect input 
port rate, it will generate an error message. 

The easiest way to ensure that an input 
port has the appropriate rate is to drive the 
co-simulation port with the same signal that 
drove the original subsystem port or gateway. 


Hardware Clocking 
There are many possible synchronization 
techniques for the interface between FPGA 
hardware and Simulink. One technique uses 
a single-step clock to keep the hardware in 
lockstep with the software simulation. This 
is achieved by providing a single clock pulse 
to the hardware for each simulation cycle. 
Using this technique enables you to perform 
incremental design and verification. 

On the other hand, the single-step clock 
technique has its disadvantages. When it is 
used for clocking the FPGA design, the 
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communication overhead between hardware 
and Simulink can severely limit the effective 
processing rate in some designs. This condi- 
tion is exacerbated by bus latency. 
Simulation speed can be greatly increased 
by allowing the hardware to process more 
than one set of input samples at a time. One 
way to accomplish this is to provide a free- 
running clock to the design under test. Use 
an explicit synchronization mechanism such 
as a flag implemented as a memory-mapped 
register to coordinate data transfers between 
Simulink and the hardware. The inputs and 











ah , AP AGG & 





Figure 6 — Co-simulation block parameterized 
with port names that match the original model. 
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A co-simulation block 
can be driven by either 
Xilinx fixed-point or 
double signal types. 


Application 





Software Simulation Time 


outputs of the design are written to and sam- 
pled asynchronously. 


Benchmarks 

When using a single-step clock, speedups of 
seven to 50 times are typically achieved. The 
demos that come with System Generator 
were used as benchmarks. The results are 
shown in Table 1. 

Referring to the table, the bit error rate 
(BER) tester was created to test forward 
A free- 
running clock source achieved a speedup of 


error correction (FEC) blocks. 


six orders of magnitude. The data source for 
the BER tester was an LFSR in the FPGA. 

The test is started, and 
szis!| after some time when a 
“done” flag is set, you can 
read the results from the 
FPGA and display them 


in Simulink. 




















Conclusion 

System Generator hard- 
ware co-simulation provides you with a sim- 
ple but powerful method of incorporating 
hardware co-simulation in Simulink and 
System Generator models. The hardware 
co-simulation flow automates the entire 
hardware implementation process. You have 
more time to focus on the design itself. 

Once the hardware co-simulation software 
has produced a co-simu- 
lation block, it can be 
used just like any other 
System Generator block 
in a model. 

To learn more about 
System Generator for 
DSB, go to www.xilinx. 
com/system_generator/ 
and click on “System 
Generator for DSP” & 


Hardware Execution Time Speedup 


Additive White Gaussian 600 sec 80 sec 7.5x 
Noise Channel (AWGN) 


Table 1 — Benchmark results for hardware-in-the-loop co-simulation 
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Get Physical with the PALACE 


Synthesis Solution 


Aplus Design Technologies’ PALACE physical synthesis tool can drive 
OOTMVOUMCOSM TAN ININOMTOOMUUNMINeamOmey ese MVONMISSI 1 
requirements with a slower speed grade or a smaller device. 
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by Sheldon D’Paiva 


Applications Engineer 
Aplus Design Technologies 
sdpaiva@aplus-dt.com 





Xilinx Spartan™-3 FPGAs give designers 
advanced features at previously unattain- 
able costs. Moreover, Spartan-3 densities 
reaching five million system gates provide 
cost-competitive solutions in markets once 
only addressable by ASICs. But as FPGAs 
penetrate the ASIC space, FPGA designers 
must deal with the same complexities faced 
by ASIC designers. 

In particular, ASIC designers know 
that to meet design requirements and 
reduce iterations within the design flow, 
synthesis must be coupled with physical 
design considerations. 

Synthesis design flows that do not take 
physical design into account can erode your 
time to market and cost advantages. With 
the large densities and complexities of mod- 
ern low-cost FPGA architectures such as 
Spartan-3 FPGAs, you cannot afford to 
ignore interconnect delay and architecture- 
specific considerations during synthesis. 

Poorly synthesized designs cause greatly 
increased runtimes for the place-and-route 
tools, as they struggle to meet timing con- 
straints. Bad designs also create an addi- 
tional burden on the designer by requiring 
expert manual intervention and time- 
consuming iterations between synthesis 
and implementation tools. 

Furthermore, additional costs are 
incurred if the design requirements are 
not met in the targeted device. For high- 
volume and low-density devices, moving 
to a faster speed grade typically translates 
to a 13% increase in cost; moving to a 
larger part translates to a 16% increase in 
cost. This increase in cost is even greater 
for lower volume designs or when using 
higher density devices. 
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Physical synthesis couples synthesis with 
physical design. It has become a key enabling 
technology that can help you maximize the 
potential of FPGAs, especially low-cost 
FPGAs such as Spartan-3 devices. 

Aplus Design Technologies, a leader in 
physical synthesis technology, has recently 
added Spartan-3 support to its fully auto- 
mated physical synthesis product called 
PALACE™ (Physical And 
Automatic Compilation Engine). The 


Logic 


PALACE physical synthesis tool combines 
logic optimization, architecture-specific 
mapping, and placement-driven synthesis 
with detailed device modeling. 

Using the PALACE tool, you can meet 
your design requirements without manual 
intervention, floorplanning, or numerous 
lengthy iterations between synthesis and 
place-and-route. 

In fact, by fully exploiting the capabili- 
ties of the FPGA architecture, you can often 
meet your design requirements in a slower 
speed grade or a smaller part and signifi- 
cantly reduce your costs. 


PALACE Physical Synthesis 

PALACE physical synthesis employs a suite 
of architecture-specific optimization tech- 
niques in both the logic and physical 
domains. These techniques include logic 
restructuring, flip-flop repositioning, con- 
straint-driven mapping with delay-area 
tradeofts, logic duplication, placement, and 
placement-driven optimization. 


PALACE technology highlights include: 
¢ Unified timing models 


— Detailed device + interconnect 
modeling 


— Consistent model throughout 
physical synthesis 


¢ Physical planning 
— Global interconnect planning 


— Timing-driven routability-aware 
g 
placement 


* Optimization in both logic and 
physical domains 


— Timing and area optimization 
— Multi-clock constraint-driven 


retiming 
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— Logic restructuring and replication 
¢ Technology mapping 
— Architecture-specific mapping 


— Constraint-driven with delay-area 


trade-off. 
The benefits of the PALACE solution 


include: 


¢ A fully automated solution with no 
expert user intervention required 


¢ Faster timing closure by reducing itera- 
tions within the design flow 


¢ Performance improvement by at least 
one speed grade 


¢ Reduction in area utilization 
¢ Reduction in place-and-route runtime. 
PALACE Design Flow 


There are two design flows that can be used 
with PALACE technology: 


1. Standard flow: This is the initial flow 
that should be used for your design. 


Mapped 
Netlist 


P 





2. Guided flow: If necessary, this flow 
should be used to achieve incremental 
improvements for a fully placed 


design (for Virtex™-II FPGAs). 


Standard Flow 

When you use the PALACE tool in the 
standard flow, you can use any synthesis 
tool that generates an EDIF netlist. This 
dramatically reduces your learning curve 
and allows you to choose the design envi- 
ronment that you prefer. Your design flow 
is essentially preserved, with only a simple 
extra step required after synthesis. 

Once you have synthesized your design, 
the resulting netlists and constraints are 
passed as inputs to the PALACE program. 
The PALACE tool processes the netlists 
together with any design constraints. Upon 
successful completion, the PALACE engine 
will generate an optimized netlist and a 
constraint file. You then use the optimized 
netlist and constraint file as you normally 
would in the rest of your design flow. 
Figure 1 shows how the PALACE solution 
fits seamlessly into your design flow. 


Physical 
Constraints 


Figure 1 — PALACE design flow 
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You can control PALACE operations in 
the standard flow by a simple four-level 
effort option. 

The effort level determines how aggres- 
sively the PALACE tool will try to optimize 
your design, with the first level optimizing 
for minimum area utilization and the 
remaining levels optimizing for maximum 
performance. The default level is the third 
level, and it is the initial recommended 
level when performance is an issue. 

After specifying the effort level, no fur- 
ther user intervention is required. Using 
PALACE technology in this flow provides 
you with the advantages of physical synthe- 
sis in a simple push-button automated 
flow, and can meet most realistic design 


requirements. 


Guided Flow 
You can use the PALACE tool in the guid- 
ed flow for Virtex-II devices to achieve the 
incremental improvements needed to 
meet your design requirements. In this 
flow, you first generate an NCD (native 
circuit description) file by placing and 
routing your design. This NCD file is 
then converted to XDL (Xilinx Design 
Language) format with the Xilinx XDL 
utility and passed as an input to the 
PALACE program, along with the design 
constraint file. The PALACE tool opti- 
mizes the paths that do not meet timing 
requirements to help you achieve timing 
closure. As with the standard flow, an 
optimized netlist and constraint file is 
produced that you use in the rest of your 
design flow. 

You can control PALACE operation in 
the guided flow by a simple two-level effort 


Category (Design) 


DSP (DES) 

Microcontrollers (uP1232a) 
Communications (Reed-Solomon decoder) 
Bus Interfaces (I?C Master) 


State Machine and Control Logic (Arbiter) 


Spartan-3 Device 


option, which determines how aggressively 
the PALACE program will try to optimize 
your design. As with the standard flow, no 
further user intervention is required. Using 
PALACE physical synthesis in the guided 
flow provides you with the capabilities of 
an advanced physical synthesis solution in 
a simple push-button automated flow. 
Thus, you can meet the “last mile” per- 
formance requirements of your design. 


Achieving Best Results with PALACE 
Effective use of the PALACE physical syn- 
thesis solution can help you meet your 
design requirements quickly. With the 
standard and guided flows, you have two 
simple but powerful solutions that should 
be used together in order to achieve the 
best results. 

When you use the PALACE engine in 
either flow, you should always include tim- 
ing constraints in the input constraint file. 
Meaningful and accurate timing con- 
straints are important because they help the 
PALACE program to focus on the problem 
areas of your design, while allowing trade- 
offs with other non-critical areas. 

If you are particularly concerned about 
area utilization, you should use the first 
effort level on your initial run. Otherwise, 
you should use the default settings, which 
will automatically run PALACE physical 
synthesis in standard flow at a high- 
optimization effort level. 

After you have obtained the optimized 
results, you can analyze either the 
PALACE report file or the report from an 
implementation run to determine if the 
result meets your requirements. If you miss 
your timing targets by a wide margin 


Performance Gain 
(% Max Frequency) 


x(35200P0408-4 





Table 1 — Examples of PALACE performance improvements 
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(more than a few nanoseconds in Spartan- 
3 devices), you should try running the 
PALACE engine in the standard flow with 
a higher effort level. If you are using a 
Virtex-II device and have still not met 
timing requirements, you should try the 
guided flow to achieve the last nanosecond 
or so of required performance. 

The performance gains that you obtain 
with PALACE physical synthesis will vary 
depending on the type and complexity of 
the design. Table 1 shows performance 
improvements obtained when the PALACE 
solution was used in the standard flow for a 
few sample designs from a variety of cate- 
gories. In all of these cases, the design flow 
was identical for two runs, except that the 
PALACE solution was used before the 
implementation stage in the second run. As 
you can see, PALACE physical synthesis 
provides a substantial increase in perform- 
ance that allows you to move to a slower 
speed grade in many cases. 


Conclusion 

As FPGAs continue to increase in density 
and complexity, you need to ensure that 
your tools extract the maximum potential of 
the device’s architecture in the minimum 
amount of time. This is especially true for 
cost-sensitive design cycles that involve the 
low-cost Spartan-3 FPGAs. Without a phys- 
ical synthesis solution that can effectively 
exploit the architecture of your target device, 
you risk overrunning your forecasted. costs 
by having to move to a more expensive part, 
or by spending too many engineering hours 
trying to achieve timing closure. 

The Aplus PALACE tool is an 
advanced physical synthesis solution that 
fits seamlessly within your existing design 
flow for all Xilinx Spartan-II, Spartan-HE, 
Spartan-3, Virtex, Virtex-E, Virtex-H, and 
Virtex-II Pro™ FPGAs, providing you 
with a fully automated solution to help 
achieve your design requirements in the 
minimum amount of time. In fact, you 
may even be able to move to a smaller part 
or a slower speed grade and dramatically 
reduce your costs. 

To learn more about the PALACE phys- 
ical synthesis tool, e-mail info@aplus- 
dt.com or visit www.aplus-dt.com. &. 
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Seamless Co-Verification 
Accelerates Time to Market 


The Seamless co-verification tool trom Mentor Graphics has been customized 


fo provide an efficient debug methodology tor Virtey-II Pro Platform FPGAs. 


fo eo Pa 
owe Fe : 


by Milan Saini 

Technical Marketing Manager 
Xilinx, Inc. 

milan. saini@xilinx.com 


The growth in the system-on-chip (SoC) 
market is fueled by the need to increase 
performance and reliability while reducing 
overall system costs. 

With the introduction of the latest Xilinx 
Platform FPGAs, ASIC-style integration 
and performance is now available in pro- 
grammable logic. This means that FPGA 
designers need access to newer electronic 
design automation (EDA) tools and 
methodologies to maintain design efficiency 
and productivity. 

Hardware/software (HW/SW) co-verifi- 
cation is an example of one ASIC method- 
ology that has recently gained relevance for 
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platform FPGAs. In this article, I will intro- 
duce the concept of co-verification, and will 
describe the role, relevance, and realizable 
benefits in the context of programmable 
systems. I will also explain the actual devel- 
opment work we've done in cooperation 
with Mentor Graphics to extend existing 
HW/SW co-verification solutions to effec- 
tively target Xilinx devices. 


The Debug Challenge 

With as many as four IBM™ PowerPC™ 
405 processors, ultra high-speed serial I/O 
technology, and up to 10 million system 
gates, Xilinx Platform FPGAs represent a 
paradigm shift in programmable platforms, 
as illustrated in Figure 1. These devices 
offer a unique and unprecedented combi- 
nation of flexibility, performance, and inte- 
gration to widely expand the size, scope, 
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and range of applications that can now be 
deployed on FPGAs. 

Just as leading-edge process technologies 
have helped silicon to achieve amazing 
capabilities, design software must keep pace 
by delivering productivity-enhancing tools 
that make it easier to design and debug. By 
various accounts, design verification is the 
most serious bottleneck engineers face 
today in delivering multimillion-gate SoCs. 
In the case of ASICs, it is not uncommon 
for verification teams to spend as much as 
50% to 70% of their time and resources on 
the functional verification effort. 

In the particular instance where a 
processor is part of a design, the interface 
between hardware and software becomes 
an area of increased focus and attention. 
Validation that the hardware and software 
will function correctly together becomes a 
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Figure 1 — Paradigm shift in programmable platforms 


critical part of the overall debug process. It 
is therefore important that specialized tools 
and methodologies be developed that pro- 
mote debug efficiency and provide a 
streamlined approach to verification. 

For a while now, HW/SW co-verifica- 
tion has been quite commonly used to 
debug ASIC SoC designs. Now, with an 
increasing number of FPGA design starts 
involving embedded processors, this tech- 
nology is becoming meaningful and impor- 
tant for FPGA designers as well. 


Co-Verification for Platform FPGAs 

Graphics’ Seamless™ Co- 
Verification Environment (CVE) is the 
industry’s leading HW/SW co-verification 


Mentor 


tool — and now it supports the Xilinx 
Virtex-II Pro™ family of FPGAs. The 
basic concept behind co-verification is to 
merge the respective debug environments 
used by hardware and software teams into a 
single framework. 

For example, a logic simulator is made to 
communicate with a software debugger, 
enabling you to get simultaneous control 
and visibility into the internals of the proces- 
sor, as well as the hardware peripheral logic 
that surrounds it, as shown in Figure 2. 

An efficient co-verification tool can help 
uncover a range of HW/SW interface 


Platform for 
Programmable 
Systems 


Platform FPGAs 


PowerPé 


¢ Memory accessing and initialization 


problems 


¢ Memory map and register map 
discrepancies 


¢ Interrupt service routine errors. 


To realize the many benefits that co- 
verification has to offer, there are three 
prerequisites for the design under test. A 
co-verification design environment typi- 


cally involves the following scenario: 


1. The system has a processor executing 
software code that interacts with sur- 
rounding custom peripheral logic. 


2. There is extensive interaction between 
software and hardware parts of the 
design during execution. 


3. Both hardware and software engineer- 
ing teams agree on using co-simula- 


tion early in the design stage. 


Software SW Domain 


Debugger 


ey Si 


These prerequisites guarantee a smooth 
methodology flow and a common commu- 
nication medium between the two teams. 


Co-Verification vs. Simulation 

Seamless CVE advances the concept of 
“functional simulation” in traditional logic- 
only FPGA designs to “co-verification” in 
processor-based Virtex-I] Pro Platform 
FPGAs. This methodology establishes value 
for multiple design teams including hard- 
ware engineers (peripheral logic debug), 
embedded software engineers (application 
and firmware debug), and system designers 
(performance analysis and tuning). Let's dis- 
cuss some of the many advantages Seamless 


co-verification offers over simulation. 


Faster Performance 

Pure logic simulation can be used to simulate 
a design with a processor component. This is 
accomplished by including an RTL model of 
the processor to simulate the software code. 

This approach, however, is painfully 
slow and not adequate to address all but 
the most basic debug requirements. The 
overall simulation speed is generally in the 
sub-100 Hz range. 

Co-verification, on the other hand, is 
able to run simulation orders of magnitude 
faster. This speedup is achieved primarily 
through the use of clever tool optimizations 
and faster processor models known as 
instruction set simulators (ISSs). 

To understand the concept of optimiza- 
tion, note that the real bottlenecks in simula- 
tion are due to the accurate but slow logic 
simulators. Every time the software needs to 


HW Domain louie 


mulator 


en ee 
reConnect ¢——4 a 
us Model J} Peripherals 


problems, including: 
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(including RTOS boot) 


Memory 
Server 


10,000X Faster Access to Memory 





Processor and peripheral initialization Figure 2 — The Seamless Co-Verification Environment connects hardware and software verification 


and configuration problems environments for Virtex-II Pro FPGAs. 
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communicate with hardware, the transac- 
tion must go through a logic simulator. 
Because software runs on the ISS and it 
runs much faster, the software and ISS are 
always waiting for the hardware and logic 
simulator to catch up. 

The Seamless tool bypasses this funda- 
mental limitation by introducing the con- 
cept of a coherent memory server (CMS), 
shown in Figure 3. Using the CMS, the ISS 


is able to read and write to memory about 


be attached that enables interactive and 
graphical debugging capability. The stan- 
dard features of a software debugger include 
the ability to step through source code (C 
and assembly), set breakpoints, and observe 
register and memory contents. This sym- 
bolic debugger gives you greater control and 
comprehension than what you would get by 
trying to achieve similar goals using the 
combination of a logic simulator and an 


HDL processor model. 


Cycle, Pin 


Accuracy 
Desired? 





Figure 3 — The Seamless coherent memory server accelerates co-simulation with faster access to memory. 


10,000 times faster than if it had to go 
through a logic simulator. Given that 
processor-to-logic interaction is mostly 
through read-write cycles to memory — 
fetching instructions, accessing peripheral 
registers, and such — the overall simulation 
speed can be dramatically increased by 
diverting most CPU-to-memory transac- 
tions to run through the faster CMS 
instead of through the logic simulator. 
Only portions of code or regions of 
memory that are under active debug are 
run through the pin and cycle-accurate 
logic simulator. This means that the simu- 
lator bottleneck only factors in less than 
1% of the software-hardware transactions, 
providing a significant overall throughput 
advantage over pure RTL simulation. 


Increased Comprehension 
To efficiently address debugging problems 
that span multiple teams, you need tools 
and methodologies that each team can 
relate to. For example, debugging processor 
code on an RTL model of the CPU is 
inherently inefficient and impractical. 

In Seamless CVE, however, because a 
cycle-accurate ISS model replaces the RTL 
processor model, a symbolic debugger can 
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Support for Abstract Models 
Oftentimes, when very high data through- 
put is required to validate certain design 
functions, RTL models have to be replaced 
with faster, more abstract behavioral models. 
These high-speed models, usually written in 
C or C++, can interact with the ISS at very 
high speeds, allowing for complex protocols 
to be rapidly and comprehensively tested. 
The Seamless co-verification tool has 
been extended to be able to plug in these 
behavioral models through its “C-Bridge” 
interface technology. By working less in the 
logic simulator and more with higher-level 
models, verification speeds can deliver sig- 
nificant performance gains. With increased 
simulation throughput, the virtual platform 
now gives you visibility into system per- 
formance and architectural trade-off issues 
at a very early stage in the design process. 
You can not only quickly validate function- 
ality, but also analyze and tune important 
system attributes, such as bus bandwidth, 
latency, and contention — all leading to 


increased. system performance. 


Additional Co-Verification Benefits 
With access to co-verification technology, 
processor-based designs are not only easier 


to debug, but the process starts much earli- 
er in the design phase. 


Find Problems Earlier 
Design teams are highly motivated to iden- 
tify and fix problems at an early stage in the 
design cycle. A well-known design axiom 
states: “The earlier a problem can be iden- 
tified, the easier and cheaper it is to fix it.” 
Typically, designers cannot initiate soft- 
ware verification until a hardware prototype 
is available. As a consequence, when soft- 
ware verification occurs in a serial manner, 
HW/SW interaction problems may not be 
detected until much later in the design stage. 
A virtual prototyping and debug envi- 
ronment removes this restriction by 
enabling product integration ahead of 
board and device availability, or even before 
the final design is committed. With the 
Seamless CVE, software teams do not have 
to wait for silicon before they can start 
developing and testing their portions of the 
design. As a result, problems can be found 
earlier and the time to working silicon is 
dramatically reduced. 


Simplified Test Benches 

To verify design functions, hardware engi- 
neers often write elaborate HDL test bench 
routines. These test benches can become 
very complex, and it is not uncommon for 
the test bench code size to approach that of 
the design itself. With co-verification, the 
ISS processor model allows test benches to 
be greatly simplified. 

For hardware verification engineers test- 
ing protocols and device drivers, test bench- 
es are simplified, because actual embedded 
software code — and not contrived test bench 
code — is driving the hardware circuits. 

Similarly, software engineers do not 
have to resort to writing stub code. Actual 
hardware devices provide real-life responses 
to calls made to hardware. Overall, this 
leads to fuller and more comprehensive test 
coverage, leading in turn to increased con- 
fidence in the working of the design in sil- 


icon in the first instance. 


Greater Runtime Control 
An important attribute of debugging in the 
virtual domain is the ability to “stop time.” 
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Figure 4 — Customized Seamless enhancements to support Virtex-II Pro devices 


As a result, it is easily possible to simulta- 
neously observe and modify the internal 
values of the CPU registers, as well as those 
of the hardware device registers with which 
the processor is communicating. This abil- 
ity to freeze and synchronize the hardware 
and software domains offers the ultimate in 
control and observability — and it is invalu- 
able in efficiently helping debug complex 


and intricate transactions. 


Virtex-Il Pro Enhancements 

In supporting Xilinx Virtex-II Pro FPGAs, 
the Seamless tool is able to largely leverage 
off its existing support for the IBM 
PowerPC 405 core. However, subtle differ- 
ences in Virtex-II Pro devices have required 
customized enhancements to the IBM 405 
Processor Support Package (PSP) and the 
Seamless CVE product to provide you with 
an efficient and easy out-of-the-box experi- 
ence, as shown in Figure 4. 

First, the PowerPC 405 ISS model from 
Mentor Graphics had to be matched with 
the Xilinx version of the PowerPC 405 core, 
which resolved some of the pin differences. 

Additionally, Virtex-II Pro devices have 
dedicated on-chip memory (OCM) con- 
troller circuitry that, in the actual silicon, is 
tied to the OCM port of the PowerPC 405 
core. The same connection had to be 
stitched in the Seamless environment to 
the PowerPC 405 ISS. 

Xilinx embedded memory blocks (block 
RAMs) also had to be specially coded to be 
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compatible with the Seamless coherent 
memory server. This was accomplished by 
inserting special hooks into the HDL 
memory models for the block RAMs. 

Finally, a conscious attempt was made to 
ease the learning curve for engineers already 
familiar with Virtex-II Pro Development 
Kit (V2PDK) flows. Three of the reference 
designs included in V2PDK were ported to 
the Seamless environment. This allows you 
to not only get a jump-start in understand- 
ing co-verification flows, but also the abili- 
ty to compare and contrast co-verification 
with pure logic simulation. 

All these enhancements are rolled into a 
customized PSP that represents the final 
integrated module, which can be used with 
the standard Seamless kernel. 

In addition, a special Xilinx-only version 
of the Seamless CVE product has been intro- 
duced. This bundle includes the Seamless 
kernel and the PSP, and it is designed specif- 
ically for the Virtex-II Pro environment. 

To prepare a design to run in the 
Seamless co-verification environment, the 


following steps must be taken: 


1. Instantiate the Seamless PowerPC 
bus interface models into your 
Verilog™/VHDL design (replacing 
the PPC Swift model). 


2. Provide the Seamless tool with access 
to the “Seamless-ready” block RAM 
memory models. These models mirror 
the HDL models but are enhanced 


with API hooks, which allows them 
to communicate with the coherent 


memory server. 


3. Input the systems memory map into 
the Seamless graphical user interface 
and define startup options for the 
hardware simulator. 


4, Load the compiled software executable 
(elf with -gdwarf) into the software 
debugger and start the session. 


Programmable Logic 

Complements Co-Verification 

The chances for first-time success with 
your design are greatly increased by early 
integration and testing in the virtual proto- 
type domain. 

However, there are classes of problems 
involving behavior that can only be cap- 
tured when the processor runs at full speed. 
In this regard, platform FPGAs serve as a 
perfect complement to virtual platform 
debug techniques. Designs can be down- 
loaded into FPGA silicon for validation at 
full system speeds. 

If problems escaped earlier attention, 
you can debug in-system with the Xilinx 
ChipScope™ Pro interactive logic analyz- 
er, or you can go back to the co-verification 
environment for a more controlled analy- 
sis. Design errors can be fixed and re- 
implemented in silicon without incurring 
the huge delays and costly mask re-spins 
common with ASIC design flows. 


Conclusion 

The current generation of Xilinx Platform 
FPGAs with powerful RISC processors 
and multimillion-gate capacities requires 
powerful and matching co-verification 
methodologies. With the availability of 
Mentor Graphics’ Seamless CVE, you now 
have access to an ASIC-strength, best-in- 
class debug solution. 

Seamless CVE provides an efficient and 
easy-to-use methodology that can integrate, 
verify, and debug hardware and software 
interactions very early in the design cycle — 
thus preserving and enhancing the critical 
time-to-market advantage of FPGAs. To 
learn more about Seamless co-verification, 


go to www.mentor.com/seamless/fpgal. ¥& 
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In FPGA Synthesis, 
Timing Is Everything 


By optimizing its timing-driven synthesis tool forSpartan-3 devices, 


the Synplicity Synplify Pro FPGA synthesis tool:makes it possible 
fo meet timing constraints and deliver significant aréa’savings. 





by Steve Pereira 
Technical Marketing Manager 
Synplicity, Inc. 
stevep@synplicity.com 


Xilinx Spartan™-3 devices let you take on 
a new set of applications once reserved for 
the ASIC domain — and complete those 
applications quickly and cost effectively. 
Closing timing is a challenge on large 
devices; however, Synplicity® has made the 
leap to the next performance level. 

The Synplicity timing-driven synthesis 
solution has been optimized for the 
Spartan-3 family. It provides optimal 
implementations of your HDL to increase 
performance, reduce area, and drive down 
the solution cost. 


Best of Both Worlds 

The Synplify Pro® FPGA synthesis tool is 
truly timing driven. Critical paths are 
speed-optimized to meet timing con- 
straints. Non-critical paths are optimized 
for area. Once your timing constraints are 


met, the Synplify Pro tool turns its atten- 
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tion to area optimization. This allows you 





to meet your timing goal in the smallest 
device possible. Timing-driven synthesis 
delivers huge area savings and still meets 
timing constraints. 

Figure 1 shows a graph of requested and 
actual maximum operating frequency 
(Fmax) for the place-and-route (PAR) 
results of a design run through the 
Synplify Pro synthesis tool. The tool’s esti- 
mates follow the requested frequency line 
until other critical paths start appearing 
that the tool cannot optimize. At that 
point, Fmax starts to degrade. 

Figure 2 shows the LUT (look-up table) 
count for the same design. The results 
underline the importance of true timing- 
driven synthesis. When the requested fre- 
quency is low, the LUT count is low. The 
LUT count only starts to increase when the 
requested frequency is raised. This perform- 
ance-on-demand feature can drastically 
reduce area while maintaining performance. 

The Synplify Pro tool can also reduce 
area by automatically mapping logic to 
dedicated resources within the Spartan-3 


architecture. The tool extracts ROMs, 
RAMs, SRL™ (scan ring linker) modules, 
and global control resources, among others. 
Extracting these logic modules increases 
performance and in most cases, decreases 
the number of logic gates in slices, which in 
turn reduces the LUT count. 


Three Ways to Win 

The Synplify Pro synthesis solution reduces 
Spartan-3-based design costs for three dif- 
ferent design scenarios: 


1. For designs with challenging perform- 
ance goals, the tool offers advanced 
logic optimizations to meet your 
requested frequency — and still allows 
you to choose a slower speed-grade 
device. Switching to a lower speed- 
grade device can drastically reduce 
costs on high production runs. 


2. For designs with both performance 
and device-size goals, the Synplify 
Pro engine provides performance on 
demand. As Figures 1 and 2 illus- 
trate, when requested performance is 
non-taxing, the LUT count remains 
very low, which lets you choose a 


smaller device. 


3. When modifications are required to 
an existing FPGA in a system, the 
cost can be high. In particular, when 
modifications fail to place or route 
properly, the most likely option is a 
complete redesign at enormous cost. 
The Synplify Pro tool reduces the 
probability of modification failure 
with its performance benefits and 
performance-on-demand area savings. 
It is much easier to route a less full 
device. Because the Synplify Pro 
engine can meet the performance 
by utilizing a small device, there is 
a higher probability of the modified 
design being successfully implemented 
with the low utilization. 


Mapping to Spartan-3 without TBUF 

The Synplify Pro tool provides simple con- 
version of tri-state bus drivers (TBUFs) to 
multiplexers (MUXs) for mapping to 
Spartan-3 devices that do not have TBUFs. 
For each net with multiple tri-state 
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Figure I — Place-and-route frequency results meet design goals. 
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Figure 2— LUT count can be traded for frequency. 


drivers, the tool creates a parallel MUX. 
For example, if the Verilog™ design code 
is shown as follows: 


ni =e1 ?d1: 1’bz; 
ni =e2 ? d2: 1’bz; 
ni =e3 ?d3: 1’bz; 
ni =e4 ?d4: 1’bz; 


then the conversion procedure in the 
Synplify Pro tool would create the follow- 
ing Verilog code: 

ni =e1 &d1|e2&d2|e3 &d3| e4 & d4 
— this is the tool’s pmux structure. 

If the conversion is done very early in the 
flow, when most MUX-based optimizations 
are necessary, this logic and the Synplify Pro 
tool deliver an optimal implementation. 
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For example, if the following code is true: 


e1 = !s0 & !s1; 
e2 =s0 & !s1; 
e3 = !s0 & 1; 
e4=s0 &s1; 


then the logic driving n1 will be imple- 
mented using 4x1 MUX with optimal use 
of MUXs. The Synplify Pro tool also sup- 
ports more complex conditions. If n1 is 
driving a primary output, appropriate 
logic would be generated so that the signal 
outside the chip shows a high impedance 
when e1 = e2 = 63 = e4 = 0. Single tri-states 
driving primary outputs are sucked into 
the pads. Internal single tri-states are con- 
verted to logic. 


ASIC to FPGA Migration 

Because Spartan-3 devices have a very high 
gates-per-dollar ratio, ASIC designers are 
increasingly adapting to FPGA solutions. 
In doing so, they also reap the benefits of 
reprogrammability and zero NRE (non- 
recurring engineering) costs. 

Among the drawbacks of the ASIC-to- 
FPGA flow are coding structure and 
instantiated components. Synplicity has 
been providing a solution (the Certify® 
product) for ASIC prototyping in an 
FPGA that addresses both of these issues. 
But for generic ASIC code, the biggest pit- 
fall is gated clocks. 

With the release of Version 7.3, the 
Synplify Pro FPGA synthesis tool can auto- 
matically convert the gated clock structures 
commonly used in ASICs so that they map 
to clock enables in Xilinx devices while 
making use of global routing resources. 
This feature has two primary benefits for 
ASIC/FPGA designers: 


1. Gated clock conversion eases the pain 
of migrating ASIC code. 


2. The tool automatically provides 
higher performance and better uti- 
lization, because global routing 
resources are used instead of internal 


high fanout nets. 


Save Money on Your Next Spartan-3 Design 
The performance of the Synplify Pro solu- 
tion, coupled with industry-leading 
FPGAs from Xilinx, gives you the ability 
to meet aggressive performance goals on 
time and on budget. 

To test the Synplify Pro tool and see the 
cost savings for yourself, download the 
Synplify Pro tool at www.synplicity.com/ 
downloads/download1.html. 

When informed that you do not have 
a license, follow the subsequent instruc- 
tions and send the information to 
license@synplicity.com. Synplicity will send 
you a temporary license immediately. 

Also, to be sure you get the most from 
the Synplify Pro FPGA synthesis solution, 
download and read “Benchmarking 
Synplify and Synplify Pro Software” at 
www.synplicity.com/literature/pdf/ 
benchmarking_synplify.pdf. & 
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~Fytends PCI Express 


Advanced Switching supports flexible interconnects, distributed computing, 
and multicast options. Teamed with Xilinx FPGAs and RocketlO transceivers, 
it will dominate nextgeneration embedded computing and communications platforms. 


by Kiran S. Puranik 


Staff Design Engineer 

Xilinx, Inc. 

kiran.puranik@xilinx.com 

The PCI Special Interest Group’s 
(PCISIG) ratification of the PCI 


Express™ Base specification in July 2002 
was a major milestone in the widespread 
adoption of serial system interconnect 
architectures. Now, the combination of 
Xilinx Virtex-IJ Pro™ Platform FPGAs, 
RocketIO™ multi-gigabit transceivers, 
and embedded IBM PowerPC™ 405 
processors is making high-performance 
PCI Express designs easier and faster. 

PCI Express is suitable for a wide 
range of computing and communications 
applications because it is a low-cost, 
high-performance, interoperable, and 
scalable solution. An important adjunct 
to the base standard is the Advanced 
Switching (AS) technology. It enhances 
the base standard to address the needs of 
the communications infrastructure for 
the next decade. 

Single-chip AS solutions for server, 
storage, and communications applications 
can be configured with a Virtex-II Pro 
Platform FPGA, on-chip RocketIO multi- 
gigabit transceivers, and embedded IBM 
PowerPC 405 processors. The extensive 
Xilinx intellectual property (IP) offerings — 
including the world’s first Real-PCI™ 
Express solution — will accelerate product 
design cycles and time to market. 
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PCI Express Architecture 
PCI Express is a layered architecture con- 
sisting of physical, data link, and transac- 
tion layers. Device drivers and application 
software constitute upper layer protocols 
(ULPs). 

PCI Express Base features include: 


¢ LVDS, 2.5 Gbps, serial links scalable 


up to 32 lanes in each direction 


¢ Embedded clock and 8b/10b transmis- 


sion code 


¢ Native hot attach/detach capability for 
high-availability applications 


¢ Robust and efficient link protocol 


* Classes of traffic and support for 
isochronous applications 


¢ Packet-based protocol and credit-based 
flow control per virtual channel 


¢ Link level and end-to-end data integrity 
(CRC-32) 


¢ Completely in-band signaling support. 


Advanced Switching 

Advanced Switching leverages the physical 
and data link layers of the PCI Express Base 
specification. Communication and embed- 
ded computing enhancements are added at 
the transaction layer to address chip-to- 
chip, backplane, and inter-chassis data 
communication requirements. The AS fab- 
ric architecture is designed for both control 
and data-plane switching applications. 





AS components can be broadly classi- 
fied as being either switches or end sys- 
tems. Figure 1 shows the AS protocol 
stack on an end system. The transaction 
layer creates an interface between ULPs 
and the data link layer. It serves as the tun- 
nel for ULP encapsulation and extraction 
at an end system. 

The AS transaction layer packet is 
shown in Figure 2. It contains an AS route 
header followed by a payload section. 
Ingress end systems provide route and 
encapsulation payload, while egress end 
systems extract payload. In order to per- 
form packet switching, the AS route head- 
er is completely agnostic to the contents of 
the payload section. 

An entire AS transaction layer frame is 
shown in Figure 3. The physical and link 
layer portions of the frame are added and 
removed at each hop between fabric 


components. 


Important Features of AS 

Legacy PCI platforms require strong 
parent-child relationships between con- 
nected components. Tree topologies are 
necessary for hardware and software com- 
patibility. In contrast, AS fabric system 
topologies can be described as a graph of 
connected switches and end systems. 
Switches constitute internal nodes of this 
graph, providing interconnects with other 
switches and end systems. End systems, 
on the other hand, are the edge nodes, 
representing data ingress and egress 
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points. The ability to support 
many topologies gives platform 
architects enormous flexibility 


regarding placement of critical ULPs 


resources. 


Supports Distributed 


Computing Architecture LAYER 3 

AS enables distributed processing ie) 

systems, which results in multiple 

memory-address domains within are 

a platform. This is in sharp con- 

trast to the single flat address LAYER 1 

domain found in legacy PCI plat- (tI) 

forms. AS platforms allow load- 

store and messaging protocol 

interactions between concurrent 

hardware and software processes 

on end systems. This facilitates 8 

peer-to-peer-based applications BYTES 

commonly found in server, stor- 

age, and communications arenas. 

Distributed processing offers bet- 4-40%6 
BYTES 


ter scalability and ultimately a 
lower cost of operation. 


Path-Based Unicast, Multicast, 
and Broadcast Packet Routing 
Unicast packet switching is based 
on path information embedded in 
the AS route header and takes the 
form of a turn pool. Path routing 
requires no up-front program- 
ming of switches. Switches simply 
look at a packet’s route header to 
determine the egress port. This 
simplifies switch design and plat- 
form configuration enormously. A 
packet’s forward route turn pool 
can also serve as a backward route 
to the source. Completions for 
read requests and event notifications to a 
packet’s source end system are examples of 
backward-routed packets. 

The multicasting feature allows an end 
system to target a packet to multiple end 
systems. A multicast group index is carried 
on each multicast packet’s route header. A 
multicast group uniquely identifies a set of 
switch egress ports for each switch on a 
multicast packet’s path. At a switch, a mul- 
ticast group table is found in a lookup table 
using the packet’s multicast group index. 
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Figure 2 — AS transaction layer packet 















levels to applications. End systems 
must operate within prescribed band- 
width limits by metering the rate of 
data transfer. 

Congestion in AS fabric can be 
caused by unexpected transient 
events such as component failure, 
accidental removal, or errant end- 
system behavior. Congestion causes 
packets to suffer excessive latencies, 
resulting in a loss of expected levels 
of service. 

AS congestion management miti- 
gates fabric congestion and maintains 
QoS levels. Congestion is detected at 
switches; when detected, Backward 
Explicit Congestion Notification 
(BECN) messages are sent upstream 
to end systems contributing to con- 
gestion. End systems respond to 
BECN messages, reducing the rate of 
data injection by a specified amount. 

Congested packets may also be 
marked with a Forward Explicit 
Congestion Notification (FECN) 
bit. FECN notifies downstream end 
systems of congestion on a certain 
path. Some AS applications may 
also set a discard bit on AS route 
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headers, allowing downstream 
switches to selectively discard pack- 
ets to alleviate local congestion. In 


the absence of BECN messages, end 
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Figure 3 — AS transaction layer frame 


The packet is then replicated on each port 
contained in the multicast group. 

When a switch detects a broadcast route 
header on a packet, it is required to repli- 
cate it on all ports except the packet’s 
ingress port. 


Congestion Management 

and Quality of Service 

Bandwidth provisioning is an important 
aspect of AS fabric management because it 


helps guarantee Quality of Service (QoS) 


5 BYTES 


systems restore normal traffic flow 
in specified increments. Fabric man- 
agement may use congestion notifi- 
cations to initiate corrective action 
and activate built-in fail-over using 
alternate paths. 


Differentiated Classes of Service 
Class of Service (CoS) mechanisms reduce 
the complexity of maintaining QoS by 
mapping multiple traffic flows into a few 
service levels. AS fabric resources are allo- 
cated based on as many as eight service 
levels called Traffic Classes (TCs). Traffic 
flows are aggregated and forwarded by 
fabric components based on the TC of 
these packets. 

Within a TC, AS fabric preserves the 
ordering of packets end-to-end, with the 
exception of those marked as bypassable at 
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the source. There is no such ordering 
requirement across TCs. AS compo- 
nents map TCs to Virtual Channels 
(VCs) corresponding to hardware 
channels within the components. AS 
components must implement at least 
two VCs. All AS link partners must 
share the same TC to VC mapping, 
downshifting if necessary to the small- 
est common number of VCs supported 
between the two link partners. 

Each AS VC contains two inde- 
pendent queues — the main queue and 
a bypass queue. AS link flow control 
manages flow credits for each queue 
A packet 


independently. marked 


bypassable must enter the bypass queue 

if it causes VC head-of-line (HOL) block- 
ing. The bypass queue is serviced as soon as 
bypass credits become available. This capa- 
bility is compatible with legacy bus proto- 
cols that require write transactions to pass 
HOL-blocked read transactions, to avoid 
possible deadlocks. 


Multi-Protocol Support 
Protocol Encapsulation Interfaces (PIs) rep- 
resent fabric management and application- 
level interfaces to the AS fabric. Table 1 
details a list of currently supported Pls. Pls 0- 
7 represent fabric management interfaces, 
while PIs 8-254 are application-level inter- 
faces. As shown in Figure 4, AS supports the 
tunneling of virtually any protocol. This 
makes AS platforms modular and cost-effec- 
tive, as well as easy to deploy and support. 
Application PI implementations man- 
date an efficient encapsulation, without 
losing a tunneled protocol’s semantics, 
which enables effective extraction at the 
fabric’s egress. A PI may be custom- 
tailored to suit a specific requirement or 
can tunnel standard protocols to support a 
broad range of applications, such as SPI, 
ATM, Ethernet, or TDM. 


Support for Segmentation and 
Reassembly 

The Maximum Payload Size (MPS) of the 
AS platform is the least common denomi- 
nator of MPS supported by all components 
within the platform. All PIs must restrict 
AS Transaction Layer Packet payload size to 
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Figure 4 — AS tunneled protocols 


platform MPS. End systems that need to 
encapsulate larger-than-MPS protocol 
packet sizes must split these into sub-MPS- 
sized segments. This also requires keeping 
track of multiple segments of the original 
packet and reassembling them at the fabric 
egress end. AS supports a standard mecha- 
nism to perform segmentation and 
reassembly (SAR) via the SAR PI. 
Individual PIs may choose other SAR 


implementations. 


PI Index Protocol Encapsulation Identity 


Fabric Discovery 


Congestion Management 
Segmentation and Reassembly 
Configuration Management 
Fabric Event 

Reserved 


Reserved 


Wie 


Table 1 — AS protocol encapsulation interfaces 


ATM 


Xilinx Real-PCI Express 

The Xilinx Real-PCI Express solution 
is a combination of two leading tech- 
nologies: the PCI Express specification 
and. Virtex-II Pro FPGAs. It provides 
user-configurable options, excellent 
flexibility, and Xilinx Smart-IP™ tech- 
nology guarantees critical timing. Key 
benefits include: 


Availability 

The world’s first PCI Express solution is 
available for download today. It enables 
your compute and communication sys- 
tems to achieve the highest level of per- 
formance using serial I/O technology. 


Performance 

The RocketIO 3.125 Gbps-capable trans- 
ceivers on Virtex-I] Pro FPGAs enable 
multiple 2.5 Gbps lane implementations 
on a single chip. 


Flexibility 

The inherently programmable nature of 
FPGAs allows you to continually tune your 
design to changing platform performance 
and functional requirements, reducing 


your risk in adopting the standard. 


Faster Time to Market 

Today, there is simply no easier way to 
develop PCI Express applications with 
minimal impact to your overall system 
development cycle. Xilinx will provide 
updates for any specification changes. 


Conclusion 

Advanced Switching architecture addresses 
all of the major requirements for next- 
generation system interconnect solutions, 
such as scalability, expandability, modu- 
larity, high availability, and peer-to-peer 
capability, with built-in QoS and CoS 
support. 

Xilinx Virtex-II Pro Platform FPGAs, 
together with the Real-PCI Express solu- 
tion, are well-positioned to provide com- 
pliant Advanced Switching solutions, 
enabling rapid product deployments. 
More information about the Xilinx Real- 
PCI Express solution may be found at 
www.xilinx.com/pciexpress. & 
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Get the Platform Flash PROMise 


Reduce your costs and minimize board space for any Xilinx FPGA design. 


by Rob Schreck 

Senior Marketing Manager, Product Solutions 
Xilinx, Inc. 

rob.schreck@xilinx.com 


Designing with FPGAs often requires two 
different types of configuration PROMs. In 
the design and test phase, in-system pro- 
grammable (ISP) PROMs offer maximum 
flexibility for design changes, while one-time 
programmable (OTP) PROMs are used for 
manufacturing once the design is complete. 
You may, however, have to change your 
board design to accommodate higher den- 
sity, lower cost OTP PROMs. This prob- 
lem becomes more acute if you are using 
high-density FPGAs that need as many as 
10 ISP PROMs and five OTP PROMs. 
Xilinx has a solution with the new 
Platform Flash PROM family, which can 
configure any Xilinx FPGA — most with 
just one PROM. You get low-cost, in- 
system programmability, so you no longer 
have to change from one PROM to 
another to reduce production costs. See 


Table 1 for details. 


Economical and Flexible 

The December 2004 100K-unit price of 
Platform Flash PROMs is projected. to be 
$1.05 for the 1 megabit (Mb) version 
(XCFO1S) and $10.40 for the 32 Mb ver- 
sion (XCF32P). This not only represents a 
dramatic price decrease over previous gen- 
erations of configuration ISP PROMs, but 
for OTP PROMs as well. 

Engineers know that they need to repro- 
gram FPGAs during logic design and testing, 
and Platform Flash PROMs offer flexible in- 
system programmability to ease develop- 
ment. However, don’t forget that flexibility 
can be used at other times in the product life. 
For example, during manufacturing, you can 
program the PROM to test the board, and 
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then reprogram the PROM with the final 
FPGA design. You increase quality while 
decreasing manufacturing costs. 

In-system programmability also lets 
you reprogram the FPGA remotely to add 
new features or fix problems. This added 
dimension of flexibility means you reduce 
maintenance and field repair costs, plus 
you keep your customers happier. 


Big Functionality in Small Packages 

The Platform Flash PROMs come in a vari- 
ety of densities in two packages. The 1 Mb, 
2 Mb, and 4 Mb density PROMs are offered 
in the very small (6.4 mm x 6.5 mm) VO20 
TSSOP package, the smallest package area 
per megabit in the industry. These versions 
offer serial configuration for FPGAs and a 


XCFOIS 


XCFO2S 


XCF04S 


very low-cost solution for 
Spartan™-IIE, Spartan-3, and 
Virtex-I] Pro™ FPGAs. 

The family is also available in 
8 Mb, 16 Mb, and 32 Mb densi- 
ties, which come in a small (8 
mm x 9 mm) FS48 thin flat ball 
grid array package. These versions 
offer both serial and parallel con- 
figuration, and are well suited for 
higher density Spartan-3 and 
Virtex-II Pro FPGAs. 

To reduce board space and costs, you 
can pack on average 50% more bits into 
your configuration PROMs using the com- 
pression capability in our higher density 
packages. You can use one low-cost PROM 
to configure multiple FPGAs. You can even 
put multiple programs on one PROM and 
change the FPGA program on the fly. 


Conclusion 

Now you can use one PROM family to con- 
figure all of your FPGAs, reducing your 
manufacturing and inventory costs. The 
Platform Flash PROM family offers you 
ultimate flexibility at a very low cost. For 


more information, visit www.xilinx.com/ 


product/platformflash/. & 
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Performance + Time 


By approaching FPGA designs as 


three-dimensional endeavors, 


you can radically reduce 
device size — and cost. 


by Ken Chapman 
Senior Staff Engineer 
Xilinx, UK 
ken.chapman@xilinx.com 


“Performance + Time = Memory” may 
sound like an odd formula, but when you 
understand it, you can realize significantly 
lower implementation costs within Xilinx 
FPGAs. In this article, Pll show you how to 
use three-dimensional (3-D) design to 
accomplish a 15X reduction in the number 
of logic blocks in a sensing application. 

Although vital for DSP applications, I 
really like the way the formula can be 
applied to so many designs. It is particular- 
ly useful for applications that are suited to 
the range of Spartan™ devices, where 
cost savings are always welcome for high- 
volume applications. 

But let’s understand the formula first. 


2-D Parallel Design 
In most hardware designs, we treat the 
Xilinx FPGA as a two-dimensional (2-D) 
fabric, as shown in Figure 1. Complex logic 
blocks (CLBs) provide the logical functions 
and blocks of RAM are used for buffers, 
such as first-in first-out (FIFO) memories. 
The tendency is for a design to become 
larger as more functionality is required. 


24 Xcell Journal 


Columns 











Rows 





Figure 1 — Two-dimensional designs consider only logic and memory. 


Therefore, it must use larger devices. The 
clock speed can often be well below 100 
MHz, and many of the functions are 
clock-enabled at even lower rates. 

Because the cost of a design is propor- 
tional to the size of the device, parallel 
implementations, even if well optimized, 
will be relatively expensive. They cannot 
be avoided where maximum performance 


is required. 


Applications such as bus interfaces that 
need a predefined number of pins and clock 
rates are also fundamentally constrained 
in the way they can be implemented. 
However, when processing functions need 
only be completed in a relatively long time 
period, such 2-D design is wasteful and 
unnecessarily expensive. 

A parallel design provides logic for 
each and every function that must be 
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implemented. This means that there is 
actually a zero requirement for memory, 
because a signal (wire) exits for every value 
to be calculated. The addition tree example 
in Figure 2 shows how the value 
“A+B+C+D” is created. Because the value is 
immediately applied to the final adder, how- 
ever, the value does not need to be stored. 

Of course, the parallel implementation 
offers the very highest performance. The 
adder tree can easily exceed 100 MHz in a 
Spartan-II device, which is equivalent to 
more than 700 million additions per sec- 
ond. However, such a structure cannot 
benefit from having more time to complete 
the required operation, other than consum- 
ing less power if it is clocked slower. 

If there is 1 ms available to perform the 
addition tree, then it can be clocked at 1 
KH. It will work, but it really is a waste of 
the Spartan-II silicon performance poten- 
tial. Even worse, the more values that need 
to be added, the larger the circuit becomes — 
and this increases the cost of your product. 


Processors Obey 3-D Formula 

Now, take a closer look at the familiar world 
of processors. A processor is a very good 
time-sharing engine. The ALU is directed 
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Figure 2 — Addition tree example 
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Figure 3 — ALUs can be time-shared. 
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to perform many different operations over 
many clock cycles to (it is hoped) complete 
the desired process in the required time 
period. The higher the performance of the 
processor, the faster the ALU will be 
clocked, and hence, the more that ALU can 
be time-shared to achieve the algorithmic 
process, as illustrated in Figure 3. 

For example, given that a particular 
process must be completed in a maximum 
time of 1 ms, the number of clock cycles 
available for the processor to exploit 
depends on the performance: 


¢ A clock speed of 1 MHz provides 
1,000 clock cycles per 1 ms. 


¢ A clock speed of 100 MHz provides 
100,000 clock cycles per 1 ms. 


¢ A clock speed of 200 MHz provides 
200,000 clock cycles per 1 ms. 


This is all very obvious, but less obvious 
is the direct link this has to memory. 

Suppose the available clock cycles are 
used by the ALU to perform the trivial task 
of summing data values. In a 1 ms time 
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period, a 1 MHz clock rate means that the 
processor has the ability to sum 1,000 data 
values. It will have to get these values from 
somewhere, and that place will be memory. 
As the clock increases to 200 MHz, it can 
then use the same amount of logic to sum 
200,000 data values — and it now needs a 
memory to hold 200,000 words. 

In a more realistic case, a process tends 
to apply multiple instructions to each data 
set, so the memory requirement to store 
data is not so high; all the same, there is a 
very strong relationship. 


3-D Sequential Design 

Making the decision to operate the logic 
functions at a higher rate than the process- 
ing rate allows operations to be achieved 
sequentially. As with a processor, logic is 
time-shared over multiple clock cycles. 
Because “Performance + Time = Memory,” 
we also need to use memory to hold all the 
values not being used in a given clock 
cycle, as well as partial/temporary results 
created during the processing. See Figure 4 
for a 3-D rendering. 


















































Figure 4 — Logic can be time-shared over multiple clock cycles. 
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The FPGA can now be thought of as a 
3-D volume to be filled. The best part is 
that you just pay for the 2-D fabric being 
occupied. The only limits to “building” 
upwards are the maximum clock rate of 
the device and the amount of RAM avail- 
able in a given block. In addition to the 
dedicated blocks of RAM, each CLB can 
be used to provide distributed RAM, 
allowing the correct amount of memory 
to be allocated in each position. This pre- 
vents memory access bottlenecks from 
forming in your design. 


3-D Approach to Design 
When any function is implemented, two 
basic questions should be asked: 


1. How much time is available to 
complete the process? 


2. Given the performance of the 
selected Xilinx device, what clock 
rate will be used? 


The answer to the first question comes 
from the design specification. The way 
you partition the design into functions 
can have quite an impact, so consider 
some alternatives. As to the performance 
of Xilinx FPGA devices, this has more to 
do with “design comfort” than the actual 
peak performance of the devices. 

Regarding the second question, I per- 
sonally like to see devices clocked above 
75 MHz, and I find this relatively easy 
to achieve. However, the higher the 
clock rate is, the more challenging 
the design is. Anything lower than a 
50 MHz clock rate is very slow and 
wasteful of the performance potential 
offered by Xilinx logic devices. 
Remember that the embedded DLL 
(delay locked loop) and DCM (digital 
clock manager) blocks can be used to 
create internal clocks of a higher rate 
than those available on the PCB. 

The answers to the first two questions 
will let you know if there is any potential 
for time-sharing of logic resources. This 
leads to a third and final question: 


3. How can the memory resources of 
the device be utilized to reduce the 


size of implementation? 
Pp 
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64 Light Sensors 
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Figure 5 — Light sensors collect data for processing. 


Now the engineering starts. It does take 
some practice, so what follows is a design 
specification for you to consider. 


Design Challenge 

The challenge is to design a small box to be 
used in factories processing items such as 
fruits and vegetables. The card is used to 
collect data from light sensors located on 
various conveyor belts, along which the 
fruits and vegetables pass as they are sorted 
for type, quality, and size. 

The initial design concept is to employ a 
microcontroller (or similar small processor) 
to collate the information and communi- 
cate it via serial (RS-232) links to a PC in 
the factory control room. 

An FPGA is being considered to inter- 
face the processor to the sensors. The prod- 
uct is required in high volume (50K to 
100K units), so a Xilinx Spartan-II FPGA 
is the target for a cost-effective solution. 

The card supports 64 sensors. A logic “1” 
signal is generated when the light beam is 
broken by a passing object. The maximum 
speed of the conveyor belt is 1 meter/sec. 
The minimum width of a single item is 3 
cm, and there is a minimum 10 cm between 
items on the belts. 

Each pulse is recorded by a separate 
counter, which can support a maximum 
value of 4,095 (12 bits). A simple interface 
to the microcontroller is then able to read 
the value of any of the 64 counters in the 
card by supplying a 6-bit address. 


Initial Observations 

Taking a very direct approach to the 
design, we could simply identify the need 
to implement 64 counters of 12 bits fol- 


lowed by a 64:1 data multiplexer. In fact, 
this is a direct representation of the block 
diagram shown in Figure 5. 

However, we need to apply some funda- 
mentally good engineering here, because we 
certainly wouldn't want to have 64 inde- 
pendent clocks in a design. Such a design 
would lead to very poor utilization of the 
device and have a high probability of unre- 
liable operation. The signal inputs really 
should be synchronized to a single internal 
clock, and then clock enables should be 


used with the counters. 


First Estimate 

Given a basic understanding of the device 
architecture, you can easily make an esti- 
mate of the device resources used. 


* Counters — Because each slice of an 
FPGA can implement a 2-bit counter, 
six slices are required to implement a 
12-bit counter. Therefore, a total of 384 
slices are required for all 64 counters. 
(Two slices form a CLB within the 
Virtex™ and Spartan-II FPGA families.) 


Multiplexer — Each slice contains two 
lookup tables and a dedicated multi- 
plexer (MUXF5), enabling a 4:1 multi- 
plexer to be implemented. However, 
each pair of slices within a CLB share 
an additional dedicated multiplexer 
(MUXF6), enabling a complete 8:1 
multiplexer to be implemented in two 
slices. Nine of these 8:1 multiplexers 
are required to construct a 64:1 multi- 
plexer, which then must be replicated 
12 times to support the data width of 
the counters. The total size of the mul- 
tiplexer is then 2 x 9 x 12 = 216 slices. 
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¢ Synchronizing Logic — At this stage, 
we have not designed the logic to cap- 
ture the input signals and synchronize 
them to the internal clock. For now, 
we will allow a slice per input (two 
flip-flops and some gates). This gives 
us a total of 64 slices. 


Based on these major building blocks 
of the design, our estimate is for 664 
slices. Thus, a Spartan-I] XC2S50 device 
is suitable with its 768 slices, providing a 
surplus of 104 slices to complete the 
processor interface. 

There are many ways to implement the 
“Performance + Time = Memory” formula — 
and we will look at just one. But as long as 
you can significantly lower the cost, you are 
well on your way to improving the profit 
margins on your own designs in the future. 

Remember, the target to beat is 664 
slices in a Spartan XC2S50 device, which 
was the result of a full parallel 2-D design. 


Implementing a 3-D Design 
We must begin our 3-D design process by 
asking the right questions that relate to the 


“Performance + Time = Memory” formula. 


10cm 


— 
100 ms 


How Much Time Is Available? 

Taking the minimum fruit size and mini- 
mum spacing between fruit passing on a 
belt at the maximum speed of 1 meter/sec, 
we derive the timing of the fastest pulses 
from a light sensor. 

We discover that the pulses are of a long 
duration and that the pulse rate is very low. 
In fact, the maximum pulse rate is less than 
8 Hz, which is very slow indeed. However, 
we must consider that there are 64 sensors 
to be monitored; we could be unlucky 
enough to have them all triggered at the 
same time. So, all 64 sensors must be serv- 
iced in a maximum of 30 ms, and the 
aggregate data rate is more like 500 Hz. 


What Performance Is Available? 

We know that a Spartan-II FPGA is the 
target architecture. This device is capable of 
operation above 100 MHz, so device per- 
formance should not limit us at all in this 
case. Although we want to get the most out 
of the silicon, there is no point overdoing it 
and burning power unnecessarily. In this 
case, it is better to work out the minimum 
clock rate required to process all 64 chan- 


3cm 


<—P 
30 ms 


Figure 6 — Minimum fruit size and spacing determine maximum pulse rate. 
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Figure 7 — Counter functionality is replaced by an increment function. 
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nels, and then tie this rate in with a suitable 
clock source on the PCB. 

Looking at the timing waveform shown 
in Figure 6, the pulse width caused by the 
smallest fruit breaking the light beam is 
the most demanding. We must guarantee 
that we observe each sensor at least once 
every 30 ms. 

If, however, the 64 sensors are observed 
and processed sequentially, rather than in 
parallel, then 30 ms divided by 64 is the 
maximum time that can be allocated to 
each sensor. This means that the minimum 
processing rate is 2,133 Hz. Obviously, 
this is still desperately slow, but it only 
emphasizes that “Performance + Time = 
Memory” must be a valid formula to be 
applied in this case. 


Replacing Counters with Memory 

We have “Time” and we have ample 
“Performance,” so now it is a case of work- 
ing out how to make the whole thing a 
sequential 3-D design. How can the mem- 
ory resources of the device be utilized to 
reduce the size of implementation? 

Because memory is used to hold data 
values, we must identify where the data is 
in the system. These may be complete val- 
ues or partial values, so we must have a 
good look through the block diagram and 
identify where the data values are. In this 
system, they are fairly obvious in that the 
counters each hold a value. In the parallel 
implementation, they are distributed 
across the 384 slices, forming the 64 coun- 
ters, but we want to consolidate them into 
a single memory. 

We can choose between distributed 
(CLB) memory and dedicated (block) 
memory, and we could really use either to 
form storage for 64 values of 12 bits. 
However, as the dedicated block RAM 
isn't required for anything else, let’s take 
that option. Configured as 256 words of 
16 bits, a single block provides more than 
adequate storage. 

The counter functionality is then 
replaced with a single increment function, 
as shown in Figure 7. A “count value” is 
read from the RAM, passed through the 
increment block, and then written back 
into the RAM at the same location. This is 
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Figure 8 — Processor can reset count values or set test values. 
















64 Light Sensors 


— Re 


Figure 9— A 64:1 multiplexer interfaces 
to the sequential engine. 


best organized as a two-cycle process, but is 
no issue given the “Performance + Time” 
that is available. 

Although we could selectively access the 
count values to be recorded as a correspon- 
ding light beam is broken, it is much easi- 
er to scan sequentially through all 64 count 
values and record only those which must be 
increased before the value is written back 
into the RAM. This reduces the address 
generation to a simple 6-bit counter. 

At this stage, we have replaced 384 
slices with one block RAM and just nine 
slices of logic (six for the increment and 
three for the address counter). This is a 
huge savings. Now, however, we must find 
ways to connect the inputs and outputs to 
this 3-D processing engine. 
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Eliminate the Data Multiplexer 


The parallel data multiplexer is simply not 
required in this design. We save 216 slices 
instantly because the count values are now 
held in one consolidated memory. The 
dual-port nature of the block RAM really 
makes it very easy to connect the external 
processor. 

As illustrated in Figure 8, the memory 
also offers the opportunity for the proces- 
sor to have a write mode to reset count val- 
ues or set test values. As with the parallel 
implementation, there is a risk that the 
processor will try to read a count value that 
is in the process of being modified. 
However, it’s very easy to allocate time for 
the recording process and time for the 
processor to read values. 


SS 
Read Data 


To Microcontroller 


— 
Read Address 






= 
Read Data 


To Microcontroller 





——— 


Read Address 


Although a clock rate of a few KHz is 
adequate for the processing, a clock of 2 
MHz (or similar clock rate associated with 
the microcontroller) would achieve a count 
value update scan in 64 ps, leaving nearly 
all of the 30 ms processing period available 
for the microcontroller to read or write 
count values. 


Connecting the Sensors 
At some point in all 3-D designs, the par- 
allel world must be interfaced to the 
sequential processing engine. This does not 
have to be difficult, and often a simple 
method is adequate, as seen in Figure 9. 
The counter used to access each count 
value from the RAM can be used to select 
the associated sensor via a 64:1 multiplexer. 
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64 Light Sensors. 
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Figure 10 — 64-bit parallel-to-serial converter replaces multiplexer. 


Although this requires multiplexer logic, it 
is just for one bit and therefore only 
requires nine slices. 

Each sensor still requires its own logic. 
This is partly to synchronize the input 
signals, but is also required to ensure that 
each “beam broken” pulse is only used to 
record a count value once. For this rea- 
son, the one slice per sensor is unlikely to 
be reduced. 

When you see that the logic size is 
increasing because the function is becom- 
ing more parallel, it is worth looking to see 
if anything else can be time-shared and 
moved into memory. In this case, we can 
indeed improve things. 

We can replace the multiplexer with a 
64-bit parallel-to-serial converter (32 
slices), which converts the parallel domain 
into a serial sequential process, as demon- 
strated in Figure 10. To detect only the 
start of a new pulse, a memory is used to 
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XC2S50 
332 CLB 
0 Block RAM 


XC2S15 
22 CLB 
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remember the last state of each of the 64 
sensors. Because the operation is so pre- 
dictable, we can use the SRLIGE memory 
mode, which requires just two slices. 


Dramatic Cost Reduction 
So was it worth it? I think the diagrams in 
Figure 11 speak for themselves. 

To reduce the function from 332 CLBs 
to just 22 CLBs is a dramatic change: 15 
times smaller. Our design now fits in the 
smallest Spartan-II device (XC2S15) — and 
actually only uses 25% of that. 

This reduction in size and cost is not 
just specific to this particular design. For 
example, much of 3G wireless processing is 
involved with “chip rates” of 1.2288 MHz 
and 3.84 MHz. This provides the time to 
allow the performance and memory of 
Virtex devices to process at least 32 chan- 
nels sequentially, in just the same way as 
our simple fruit counter. 


Sequential 3-D Design 
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Figure 11 — 3-D design yields compelling results. 
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Final Considerations 

The Spartan-II XC2S15 has only 86 user 
I/Os, and our design has high I/O 
demands. Having used 64 for sensor 
inputs and applied a clock, only 21 I/Os 
are left for the microcontroller interface. 
Given an 8-bit data bus, it is possible to 
connect to the microcontroller, but it 
does illustrate how I/O can limit a design 
once these highly efficient techniques are 
employed. 

Of course, it would be a pity for 75% of 
the XC2S15 to be completely wasted. It 
would be nice to embed the microcon- 
troller and the UART in the same device. 
This is also possible, but it’s a topic for 
another article. 

Meanwhile, once you discover that 3-D 
designs are possible, you are well on your 
way to improving the profit margins on 
your own designs. 


[Editor’s note: This article was derived 
from a two-part TechXclusive on the 
support.xilinx.com website. To see the 
original TechXclusive, go to 
support.xilinx.com/support/ 
techxclusives/3-D-techX22.htm and 


support.xilinx.coml/support/ 
techxclusives/3-D2-techX23.htm. 

To see more TechXclusives, 
go to support.xilinx.com and search for 
“TechXclusives,” then click on “Xilinx 


TechXclusives Home.” | & 
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How to Implement a 


DDR SDRAM Controller 


The DDR SDRAM specification can make designing a controller tricky, 
but using an innovative approach with Xilinx Virtex-II devices lets you 
avoid the time and-costs associated with ASIC implementations, 


by Tim Murphy 

Senior Design Engineer 
Plexus 
tim.murphy@plexus.com 


Our customer's application required 
buffering a large amount of data, which 
prevented us from using internal memory 
and also made external SRAM cost pro- 
hibitive. Product volumes were too low to 
justify spending the non-recurring engi- 
neering costs associated with an ASIC, 
especially for initial prototypes. The data 
transfer rate through the buffer ruled out a 
single data rate SDRAM. We needed a 
double data rate (DDR) SDRAM con- 
troller running at 200 MHz. 

Several built-in Virtex™-II features 
enable a design at DDR SDRAM clock 
rates. Although implementing the con- 
troller was still a challenge, we found the 
ideal solution — from both a cost and band- 
width perspective — was a DDR SDRAM 
with the controller implemented in a 


Virtex-II XC2V2000-5FG676 device. 


DDR SDRAM Basics 

DDR SDRAM is very similar to single data 
rate SDRAM, except that data is clocked 
into or out of the device on both the rising 
and falling edges of the clock. All control 


signals, however, still change only on the 
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rising clock edge. During any data access, 
the controller provides the DDR SDRAM 
with a clock, inverted clock, address, and 
control signals. During a write cycle, the 
controller also provides data and a data 
strobe signal (DQS). During a read cycle, 
the DDR SDRAM provides data and the 
DQS signal. 

The DQS signal is used to clock data 
into the DDR SDRAM on a write and 
into the controller on a read. Bank pre- 
charging, refreshes, and so forth are han- 
dled in a DDR SDRAM controller in 
much the same way they are handled in a 


standard SDRAM. 


Design Challenges 
Keys to a successful DDR SDRAM con- 
troller design include a thorough under- 
standing of the clocking specifications and 
the clever use of the Virtex-II digital clock 
manager (DCM). 


Clocking 

The DDR SDRAM specification requires 
that the clock and inverted clock received 
from the controller cross within a very 
tight window, as shown in Figure 1. The 
crossing point of these clocks is considered 
the clock edge in the DDR SDRAM spec- 
ification. To ensure our design met this 
window, we used an LVDS signal pair 





configured using the SSTL signaling stan- 
dard. Even though LVDS wasn’t used, the 
fact that this pair of outputs could be used 
to drive 622 Mbps differential data 
ensured that the skew between the outputs 
was small enough to meet the DDR 
SDRAM requirement. 


Data Capture 

As mentioned earlier, data is clocked into 
and out of the DDR SDRAM using the 
DQS signal. The DQS signal is bidirec- 
tional. During a write, it’s driven by the 
controller. To maximize setup and hold 
time windows, the controller must drive 
DQS 90° out of phase with the data. Data 
is clocked by the DDR SDRAM on both 
edges of DQS. 

During a read, the DDR SDRAM pro- 
vides both data and DQS. However, the 
DDR SDRAM provides data and DQS 
coincident with each other. This means the 
controller must either provide the 90° 
phase shift internally or find another way 
to clock in data. In addition, DQS is a 
strobed signal. It is driven while there is a 
transaction in progress, but tri-stated oth- 
erwise. These two items complicate the 
controller design a bit. 

Because DQS is strobed, a DCM can’t 
be used to provide the shift because the 
DCM needs a continuously running clock. 
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The DQS net could potentially be delayed 
inside the FPGA, but there is no way to 
specify a minimum delay to the tools. 
FPGA Editor could be used to add delay by 
hand, but relying on a minimum delay is 
bad design practice. The DQS net could be 
delayed on the board. This is a bit more 
straightforward. Assuming the PC board is 
made of FR4 material, the net will see a 
delay of ~150 ps/inch. This helped our read 
timing, but unfortunately hurt our write 
timing by the same amount. 


The Solution 

Our design solved the problem by not 
using DQS at all — except as a data enable. 
Instead, we started with the internal clock 
sent to the DDR SDRAM (referred to as 
the write clock) and ran it through a 
Virtex-II DCM. The DCM phase-shifted 
the clock by an amount carefully deter- 
mined through a detailed timing analysis of 
the system. This clock is referred to as the 
read clock. 

In addition, we treated the read and 
write clocks as asynchronous and used the 
DQS as a data enable, as shown in Figure 2. 
As a result, our design can tolerate a skew 
between the read and write clocks of any- 
where from 0 ns to 10 ns (one clock peri- 
od). Therefore, if the SDRAM timing is 
affected by a new SDRAM part or board 
change, the FPGA does not have to be 
re-done. 


Other Useful Virtex-II Features 
We also took advantage of several other 
Virtex-II features in our design: 


¢ The Virtex-II has DDR registers built 
into the input/output blocks (IOBs). 
This feature allowed us to isolate the 
200 MHz DDR signaling in the IOBs, 
and allowed the rest of our design to 
run at 100 MHz. 


¢ XCITE — Xilinx digitally controlled 
impedance (DCI) technology allowed 
us to terminate the DDR SDRAM 
signals without using any external 
resistors. 


¢ The DCMs allowed us to synthesize 
clocks, limit jitter, and reduce clock 
skews. 
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This is considered 
the clock edge. 
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cross in this region. 
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Figure 1 — Clock crossing 
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Process 

The architecture for the entire design was 
created before any RTL code was written. 
This practice allowed us to explore various 
alternatives quickly and select the best one. 
When we started our RTL coding, we ran 
into very few problems. 

A DDR SDRAM has many corner 
cases. It is extremely difficult to find prob- 
lems in hardware. However, using Mentor 
Graphics’ ModelSim™ simulator, Xilinx 
UNISIM libraries, and Microns DDR 
SDRAM memory model, we successfully 
simulated and debugged all corner cases. 

DDR SDRAM corner cases can be diffi- 
cult to control because they are independent 
of the design’s intent. Here’s one example: A 
test writes a series of words into the 


SDRAM, but the SDRAM needs a refresh 


: Read Active Read Active F 
Data Valid feel oisetD Read Active ass 
Data Valid B Logic an Ee |Read Active j 


cr 


System Clock 
Domain 


ll 


Gar 


Upstream Logic 


Figure 2 — Synchronization scheme 





in the middle of the burst. These corner 
cases are very hard to target. Generating ran- 
dom tests and running code coverage helped 
determine which corner cases were hit and 
which needed more work. 


Conclusion 

Designing a DDR SDRAM controller 
can be tricky. By utilizing the Virtex-II 
features along with a solid process and a 
little creativity, however, we were able to 
implement a 200 MHz DDR SDRAM 
controller in a —5 speed grade Virtex-II 
FPGA. 

To design this controller in an ASIC 
would have cost a great deal more, and it 
could not have been possible in previous 
FPGA architectures. Visit 
www.plexus.com for more information. & 
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Is Your FPGA 
Design Secure? 
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After spending months on 
your design, the last thing 
you want is to find your 
design has been stolen. 
Say goodbye to “locks,” 
“fuses,” “antifuses,” and 
other contraptions. You can 
sleep peacefully when you 
design with Xilinx. 


by Anil Telikepalli 

Marketing Manager, Virtex Solutions 
Xilinx, Inc. 

anil. telikepalli@xilinx.com 


With an increased focus on security and 
terrorism in the world, it is natural for you 
(and me) to worry about the security of our 
designs and products. Are your ICs really 
foolproof? Can someone steal your design 
inside the chip? What about cloning? 
Reverse engineering? 

Here at Xilinx, we want you to rest at 
ease. All Xilinx devices (FPGAs and 
CPLDs) have robust security mechanisms 
that make it nearly impossible to steal 
designs. In particular, Virtex-II Pro™ 
Platform FPGAs have advanced Triple 
DES (Data Encryption Standard) security 
embedded in them. We are not talking 
about my grandmother's “secure” pepper- 
mint box with its heavy lock that you or I 
could crack open with a toothpick. Triple 
DES algorithms provide the same security 
that the world’s financial institutions rely 
on every day for transactions involving tril- 
lions of dollars. With security mechanisms 
you can “bank” on, let’s get into the details 
of design security and how Xilinx protects 
your valuable proprietary designs. 
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Design Security 
Consider the philosophy of theft. There is 
nothing such as absolute security in this 
world. A determined thief can break any 
barrier if given enough time and resources. 
Our goal is to make it extremely diffi- 
cult to break that barrier. Hence, we've 
raised that barrier to the state of the art. 
Breaking the Xilinx barrier is virtually 
impossible for the vast majority of pirates. 
To help you understand just how we do 
this, let’s look at the three basic levels of 
attack on a security barrier. 


Categories of Security Attacks 

IBM™ defined three categories of security 
attacks in a paper published in 1991. 
Although the paper is somewhat dated, it 
still serves as a pretty good reference 
(Abraham, D., G. Dolan, G. Double, and 
J. Stevens, 1991. Transaction Security Sys- 
tem. JBM Systems Journal 30(2): 206-229.). 


1. Class I: This is a clever outsider 
and a curious person with negligible 
resources. Representative of the 
majority of the population of hackers, 
he is not interested in wholesale pira- 
cy. He simply wants a capability for 
personal use. Seemingly harmless, this 
class of thief can pose a significant, 
worldwide threat if he shares the ill- 


gotten information on the Internet. 


2. Class II: This person is a knowledge- 
able insider with access to some 
sophisticated resources. Examples 
include university students and 
unscrupulous corporate employees 
with access to the Internet. However, 
the information provided by this class 
tends to be esoteric and only usable 
by the same class or higher. 


3. Class II]: This is a funded organization 
with a determined team of experts — 
people who can crack open literally 
anything. Examples include the FBI, 
CIA, NSA, and any other large com- 
mercial or governmental operation 
with unlimited funds for wholesale 


reverse engineering operations. 


Class I and Class II attacks can be 
deterred; a good IC helps the designer 
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achieve this objective. Xilinx Virtex-II Pro 
FPGAs provide a redundant key structure 
to thwart even most Class II] attacks. 


Programmable Device Security 

There are two major types of design theft — 
cloning and reverse engineering. Cloning is 
copying the design as-is. Reverse engineer- 
ing is more sophisticated stealing, where 
the thief extracts the design implemented 
in a device and then improves, modifies, 
and disguises it so that it no longer looks 
like the original implementation. 

ASICs and ASSPs implement a fixed 
function and are vulnerable to attack, 
because every via/connection represents 
real logic. An attacker can de-cap such 





Figure 1 — An attacker can extract nonvolatile device die from 
a chip package and find security bits using thermal imaging. 


chips to observe the vias and hack the 
design. Sometimes bogus logic is includ- 
ed to confuse the attacker, but the chips 
are still at risk. In general, the more fixed 
an IC is, the more the risk is. Plus, in a 
fixed IC, if you suspect that the security 
of one chip is compromised, then all the 
chips are compromised. With program- 
mable ICs, however, if one chip is com- 
promised, all the other chips can be 
reprogrammed. 

Programmable devices (FPGAs and 
CPLDs) come in two types — nonvolatile 
(antifuse/flash) (SRAM) 


devices. There are many differences between 


and volatile 


them, each with unique advantages. 

For our discussion on security, we will 
only focus on the security of the program- 
ming/configuration bits. Nonvolatile 
devices retain the configuration bits and do 
not need to load the bitstream from an 
external PROM every time they are pow- 


ered up. In the past, the external PROM 


was a cause for security concerns and non- 
volatile devices had a slightly upper hand. 
But that was before the debut of the 
industry’s first secure SRAM FPGAs in 
2000 — the Xilinx Virtex™-II Platform 
FPGAs with Triple DES encryption. 


Nonvolatile Programmable Devices 

Let’s take a look at today’s nonvolatile pro- 
grammable logic devices (PLDs) that include 
both FPGAs and CPLDs from competing 
vendors. These devices provide a test mode 
to observe the internal circuit to improve 
yields as well as a read-back mode for cus- 
tomers to inspect the programmed bits. 

Both these modes expose the devices to 
attacks. To prevent such attacks, fixed secu- 
rity bits are built with antifuse 
or flash gates to prevent 
read/write from the device after 
it is ready. Here is the weakness: 
An attacker can implement a 
simple circuit in two identical 
devices — one secured and the 
other not. The security bits 
have fixed locations and can be 
distinguished by observing both 
the devices using thermal imag- 
ing (Figure 1). Once distin- 
guished, the security bits can be 
disabled with a high-intensity light source 
impinged on the die to allow read-back. 
Any circuit implemented in that particular 
device is now available to the thief. 

As device size increases in programma- 
ble devices, the configuration bitstream size 
increases. When key size also increases with 
the device size, it indicates a fixed location 
pattern of security bits. We can conclude 
that if one such device is cracked, the secu- 
rity of devices of all densities is cracked. 
Sounds scary? You bet. 

Xilinx CoolRunner™-II CPLDs use 
flash technology, but they are different 
from other nonvolatile devices. 
CoolRunner-II CPLDs add multiple layers 
of security, with read/write-protect security 
bits that are hard to find. The bits are 
placed among the programming bits deep 
under several layers of metal. 

The security bits are also placed in 
such a way that requires a specific 
sequencing of signals to set and clear 
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Figure 2 — Several metal layers in CoolRunner-II 
CPLDs prevent the theft of your design. 


them, as well as charge pumping. In addi- 
tion, four or five layers of metal (Figure 2) 
rule out direct exposure of top die to 
laser/electrical tampering. 

Two CoolRunner-I] CPLD technolo- 
gies —- DualEDGE and DataGATE — are 
schemes that also confuse attackers with 
double data operation and input signal 
locking under internal macrocell control. 
Furthermore, the legendary low power 
operation of CoolRunner-II CPLDs 
makes it difficult to see anything under 
thermal imaging. For more information 
on CoolRunner-II security, see the 
white paper “CoolRunner-II CPLDs in 
Secure Applications” at www.xilinx.com/ 


publications/whitepapers/wp_pdflwp170.pdf- 


Volatile Programmable Devices 

Volatile SRAM FPGAs lose the configura- 
tion bitstream whenever the power goes 
off, and thus they need an external memo- 
ry to hold the bitstream. A typical applica- 
tion includes an external PROM next to 
the FPGA. The bitstream between the 
PROM and the FPGA is a cause for securi- 
ty concern for some designers. 

To alleviate these concerns, Xilinx 
embedded Triple DES encryption technol- 
ogy into Virtex-II FPGAs three years ago. 

The second generation of this technol- 
ogy is now embedded in the latest Virtex- 
Il Pro FPGAs. With this scheme, the 
PROM contains the encrypted bitstream, 
which is decrypted inside the FPGA. No 
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read-back is allowed. Any form of attack 
on the FPGA erases the design. The best 
an attacker can do is to intercept the 
design between the PROM and the FPGA 
— but all he will obtain is the triple DES 
encrypted bitstream. 


Security You Can Bank On 

Triple DES is the pre-eminent encryption 
standard that is used by financial institu- 
tions for millions of transactions every day 
involving trillions of dollars. 

This same technology is used to provide 
security for your designs in Xilinx FPGAs. 
Although there have been reports of the 
older DES standard being cracked, Triple 
DES is yet to be cracked. 

Triple DES is an official NIST 
(National Institute of Standards and 
Technology) and American National 
Standards Institute (ANSI) X9.52. stan- 
dard. AES 


Standard) is the next generation of encryp- 


(Advanced Encryption 


tion that will be adopted by the industry if 
and when Triple DES is 
Encryption gurus are already working on 


cracked. 


developing future versions of AES in a con- 
stant effort to keep hackers at bay. With 
trillions of dollars at stake, millions of 
financial transactions, and perhaps thou- 
sands of hackers at work on various types of 
attacks, Triple DES has held its ground to 
date. This is why Xilinx chose Triple DES 
as the best encryption capability to use in 
its flagship Virtex-II Pro product line. 





Triple DES is a symmetric encryption 
algorithm, which means the keys used to 
encrypt and decrypt are the same. The 
security of the data lies in the key — in con- 
trast to public key systems such as RSA or 
PGP. Triple DES uses three keys and the 
encryption algorithm is repeated for each 
key for added security. Each key is 56 bits 
wide and encrypts 64-bit blocks at a time. 
For more information on the Triple DES 
standard, go to: www.itl.nist.gov/. 


Triple DES in Virtex-II Pro FPGAs 
Virtex-II Pro devices have an on-chip 
decryption engine that can be enabled to 
secure the configuration bitstream, and 
hence the FPGA. You can encrypt the bit- 
stream in the Xilinx software with a set of 
keys, and the Virtex-II Pro device decrypts 
the incoming bitstream internally using the 
same set of keys (Figure 3). 

Once the design is placed and routed in 
ISE tools, the encrypted configuration bit- 
stream is generated using the ISE’s BitGen 
program with user-selected keys (Figure 4). 
The same keys are loaded into the FPGA 
through the JTAG port using the ISE 
iMPACT tools (Figure 5). 

To program the keys into the device, 
the device has to be put into a key access 
mode, which automatically erases the 
FPGA -— including any old keys and the 
actual bitstream. This also provides protec- 
tion from attacks. 

After the device is programmed with 
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Figure 3 — Design security in Virtex-II Pro FPGAs using Triple DES encryption/decryption 


keys, it can be configured with the encrypt- 
ed bitstream. Once an encrypted bitstream 
has been programmed into the FPGA, it 
cannot be reconfigured, or partially config- 
ured, or read-back by unintended or 
intended tampering or snooping. Any 
attempt to steal a design automatically eras- 
es the FPGA completely. (Note that non- 
encrypted bitstreams may be programmed 
into an FPGA loaded with keys to facilitate 
test and debug.) 

The keys reside in a special Triple DES 
block at the corner of the device. An exter- 
nal battery is used to hold the keys when 
the board is powered off. Any standard bat- 
tery between 1V and 4V can be used, yield- 
ing a 15-year life. 

Although nonvolatile PLD security 
bits disable read/write into the device, 
Virtex-II Pro Triple DES provides real 
security using encryption. Even though 


the location of the Triple DES block 
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in Virtex-II Pro FPGAs is known, the 
keys cannot be observed without erasing 
the chip. 

Nine layers of metal pose yet another 
barrier to would-be thieves. Any attempt to 
use thermal 
imaging to find 
the security bits 
buried inside 
Virtex-II_ Pro 
FPGAs is des- 
tined to failure, 
because there 
isn’t even hard 
wiring involved 
to store the security bits. 

Each Virtex-II Pro device provides six 
separate keys, thus allowing two sets of 
Triple DES keys. You can program the 
devices up-front with both sets of keys. 
This provides security even against Class 
HI attacks. For example, if the first key set 


Translate & 
ISE design tools flow WET o) 


(FPGA Flow) 
Floorplanner 
dC 7AW =olivele 


Timing 
Analyzer 


Configuration 
BitGen 


is compromised, you can remotely repro- 
gram devices in the field with a bitstream 
using the second set of keys. 

In addition, some of our customers use 
this technique for new business strategies 
and pricing models — one key set bitstream 
enables a low-price, low-feature service, and 
the other set activates a high-price, high- 
feature service. 


Export Considerations 

Encryption standards are tightly con- 
trolled in the U.S. by the Department of 
Commerce. The decryptor on the chip 
can only decrypt the incoming bitstream 
and is not available as a design block; 
hence, Virtex-II Pro FPGAs have been 
classified as field programmable logic 





Figure 5 — Securing the device: 
Select the keys in ISE iMPACT tools. 


devices (3A001.a.7), the same as any other 
FPGA. The software has been classified 
under ECCN# 5D002 and can be export- 
ed globally — with the exception of coun- 
tries banned by the Department of 
Commerce. The bottom line is that no 
changes to your current export practices 
are necessary to include Virtex-II Pro 
FPGAs in your system. 


Conclusion 

Xilinx FPGAs and CPLDs provide unpar- 
alleled security for your designs, helping 
you achieve your design goals while main- 
taining the highest design security you 
can get. 

This dedication to the protection of 
your designs is one of the reasons why 
Xilinx has become the largest vendor of 
programmable logic devices in the world. 
We offer the highest performance, lowest 
cost solutions with maximum design secu- 
rity so you can rest easy. &£ 
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Virtex FPGAs Tame DSP-Hungry 
Broadband Wireless Designs 


Second-generation Broadband Fixed Wireless Access standards are stil 
coalescing — but all require high-performance signal processing power. 
You can meet your processing goals while maintaining standards 
flexibility by using Virtex FPGAs. 





by David Nicklin 

Senior Manager, Strategic Solutions Marketing 
Xilinx, Inc. 

dave.nicklin@xilinx.com 


David Eden 

Systems Specialist, Europe 
Xilinx, Inc. 

david. eden@xilinx.com 


Broadband Fixed Wireless Access (BF WA) 
technology standards are consolidating 
behind the IEEE 802.16 specification in 
the U.S. and the ETSI BRAN (European 
Telecommunications Standards Institute 
Broadband Radio Access Networks) spec- 
ification in Europe. This convergence 
makes BFWA technology commercially 
attractive, particularly as a flexible, cost- 
effective, 3G (third-generation) cellular 
backhaul solution. 

Both standards take broadly similar 
approaches to solving the Quality of 
Service (QoS) and bandwidth-utilization 
limitations experienced by first-genera- 
tion BFWA services. However, both 
standards demand greater signal process- 
ing performance than conventional 
DSPs can deliver today, which leaves the 
door open for developing custom hard- 
ware to boost performance. 

The BFWA market remains difficult 
to predict, especially in the domestic and 
SME (small and medium enterprise) sec- 
tors. Some observers, for example, 
believe standards must harmonize fur- 
ther to make the technology viable. 
Therefore, developers must find ways to 
achieve hardware acceleration without 
compromising flexibility, time to mar- 
ket, or cost-effectiveness. 
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System Challenges 

The differing requirements of 3G backhaul 
and residential/SME applications mean 
that, as a designer, you must create access 
solutions that are capable of handling a 
wide range of services. These include legacy 
services such as time division multiplexing 
(TDM), IP (Internet Protocol), and VoIP 
(Voice over Internet Protocol). A variety of 
backhaul requirements must also be accom- 
modated, including ATM and packet-based 
protocols. In other words, your design must 
have enough flexibility to efficiently carry 
any traffic type. 

Maintaining QoS and deliver- 
ing 99.999% availability, or “five 
9s,” is made more difficult by 
environmental effects such as 
rain, obstructions, or other non- 
line-of-sight conditions. Both 
the IEEE and ETSI standards 
have introduced adaptive coding 
and modulation algorithms that 
make the best use of the available 
bandwidth under favorable link 
conditions, and they invoke a 
more reliable alternative to main- 
tain availability under unfavor- 
able link conditions. Co-channel 
interference also acts to degrade 
link quality, and both standards 
address this issue as well. 

A wireless access system may 
be presented with multiple con- 
nections per terminal, multiple QoS levels 
per terminal, and a large number of statisti- 
cally multiplexed users. As a result, second- 
generation systems demand an extremely 
high-performance, scalable, signal process- 
ing platform designed for wireless processing 
and dataflow. 


Physical Layer Design 

While the Media Access Control (MAC) 
layer handles algorithms relevant to the 
various traffic classes, complex adaptive 
coding and modulation are performed at 
the PHY (physical) layer. 

The PHY specifications for [EEE 
802.16 and ETSI BRAN standards estab- 
lish a burst specification that allows both 
time-division duplexing (TDD) and 
frequency-division duplexing (FDD). Both 
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TDD and FDD support adaptive burst 
profiles in which transmission parameters 
relevant to modulation and coding may be 
assigned dynamically on a burst-by-burst 
basis. Further complexity is added by 
including support for half-duplex FDD 
subscriber stations (which may help to 
reduce the cost of subscriber equipment 
because they do not simultaneously trans- 
mit and receive). 

Variable burst profiles require extensive 
processing resources, including Reed- 
Solomon forward error correction (FEC) 
with variable block size and error correc- 





tion capabilities. Moreover, FEC is com- 


bined with 16-state quadrature amplitude 
modulation (16-QAM) or 64-QAM in 
both ETSI BRAN and IEEE 802.16. ETSI 
BRAN also defines 4-QAM. 

A PHY implementation for either stan- 
dard integrates transceiver and CODEC 
(coder/decoder) functions as well as back- 
plane interfaces and data queuing. This 
implementation calls for complex mixers, 
frequency synthesizers, mapping capabili- 
ty, automatic gain control, and access 
point synchronization in the transceiver. 
Additional CODEC blocks 
Viterbi and convolutional coding; inter- 
face buffering; PDU header controls and 
scramble/descramble; and MUX/DEMUX 


(multiplexer/demultiplexer) 


include: 


functions. 


Additional interfacing and data-queuing 


functions include frame formatting, cell 
queuing, and lookup or PDU overhead 
formatting. 

As a result, ETSI- or IEEE-compliant 
base stations must integrate extensive pro- 
cessing capabilities, many of which you can 
perform in hardware. 


Hardware Acceleration 

In both second-generation BFWA stan- 
dards, the algorithmic complexity of the 
PHY layer in particular has accelerated 
faster than Moore’s law can empower DSPs 
to keep pace. Still, a software-reconfigurable 
solution is desirable given the 
current uncertain market con- 
ditions. Market projections are 
under constant review, and 
some manufacturers believe 
further harmonization of stan- 
dards must take place before 
the true potential of BFWA will 
be realized. 

All of these factors make it 
extremely difficult to plan an 
ASIC development with confi- 
dence. These factors also make 
creating the ICs for standards- 
compliant equipment a signifi- 
cant challenge, especially if you 
want to avoid the fixed engi- 
neering development costs. 

On the other hand, high- 
speed FPGAs allow you to 
create custom hardware and exploit the 
massive parallelism needed to meet per- 
formance goals without sacrificing flexi- 
bility. You can also achieve high 
integration by implementing DSP func- 
tions alongside protocol translation, glue 
logic, and other system functions. The 
Virtex™-II and Spartan™ FPGA fami- 
lies also provide a wealth of processing 
resources, including Xilinx MicroBlaze™ 
soft processor cores, as many as four 
embedded IBM PowerPC™ cores (in 
Virtex-I] Pro™ Platform FPGAs), and 
user-selectable I/Os. 

These resources combine well with large 
IP (intellectual property) libraries, through 
which Xilinx offers functions such as 
Viterbi, turbo-product coding, and other 
off-the-shelf functions. 
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FPGA Implementation 
Figure 1 shows a block diagram 
for an ETSI-compliant BRAN 


Platform FPGA 
PHY layer. You can create the 
FEC Decoder 
Descrambler 
Decryption 





design represented here with 
either two Virtex-II XC2V3000 
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Bus 
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lapping 


Utopia 
Slave I/F 
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FPGAs or one XC2V6000. 


Transceiver Design 

The access point transceiver 
shown in the diagram includes a 
Hilbert Transform decimator and 
interpolator and complex mixers 
running at around 100 Mbps, or 
double the symbol rate. You can 
implement these using the Xilinx 
System Generator tool, exploiting 
block RAM and embedded multi- 
pliers available in the Virtex archi- 
tecture as well as its extensive logic 
resources and look-up tables 
(LUTs). The mapper function 
uses block RAM, multipliers, and 
LUTs to perform LUT-based 
mapping. 

Additional major functional blocks 
include complex variable decimation and 
interpolation (CVD/I), linear interpolated 
digital frequency synthesis, and access 
point synchronization. You can easily build 
all of these functions using the 
XC2V3000/6000’s plentiful block RAM, 
multiplier, and LUT functions. 


CODEC 

When designing the CODEC, the com- 
bination of Virtex high-speed processing 
performance and off-the-shelf IP is espe- 
cially powerful. You can implement a 
standard Viterbi decoder directly in the 
Virtex-II architecture using approximate- 
ly 1,000 slices and two block RAMs. 
Supporting OC3 (Optical Carrier 3 — 
155 Mbps) data rates and higher, the 
Viterbi decoder implementation offers 
fully synchronous two-clock or one-clock 
versions to reduce latency, size, and power 
dissipation. Because the system-code rate 
for ETSI BRAN and IEEE 802.16 varies, 
the decoder also allows you to change 
puncturing on the fly by using control 
bits generated from data formatting. 
The decoder is available as VHDL source 
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Figure 1 — Access-point transceiver for ETSI-compliant BRAN PHY layer 


code or as a fixed netlist from the Xilinx 
LogiCORE™ IP program. 

Both BFWA standards call for Reed- 
Solomon forward error correction with 
variable block size and error correction 
capabilities. We have modified the standard 
Xilinx Reed-Solomon IP core to meet the 
required frame duration and symbol rate, 
and to reduce latency below 92 byte peri- 
ods. You can also implement convolutional 
decoding and encoding using standard 
Xilinx IP cores. The ETSI BRAN specifica- 
tion includes headroom for optional Turbo 
product codes, which are also readily avail- 
able as complete IP cores from Xilinx. 


At the Backplane 

The Virtex-I] XC2V3000/6000 devices can 
also implement backplane interface and data 
queuing functions by using the standard 
Utopia2 (Universal Test and Operations 
PHY Interface for ATM Level 2) backplane 
interface IP core. Furthermore, you can 
custom-define a state machine to perform 
functions such as scheduling and timing 
in about 200 © slices of the 
XC2V3000/6000 devices. With all of this 
functionality, you still have plenty of 


block RAM and logic resources for queu- 
ing, overhead formatting, and header 
processing. 


Conclusion 

New-generation standards for broadband 
fixed wireless access have helped make the 
technology commercially attractive. But 
considerable uncertainty remains regard- 
ing future standards and markets. This 
uncertainty demands flexible, adaptable 
solutions. 

At the same time, to answer the QoS and 
availability limitations of first-generation 
systems, second-generation standards 
require processing power beyond the capa- 
bilities of conventional reconfigurable DSPs. 

Xilinx FPGAs deliver the flexibility and 
raw DSP performance you need as well as 
the extensive IP cores and engineering sup- 
port to solve these diverse challenges. 


For more information, visit these websites: 


¢ Xilinx Wireless Networks Solutions — 
www.xilinx.coml/esp/wireless_networks/ 
index.htm 


¢ IP cores suitable for BFWA — 
www.xilinx.comlipcenter/. & 
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A unique, scientific research instrument and 
art piece, the BioWall models bio-inspired 
electronic tissues capable of evolution, self 
repair, self-replication — and learning. 

Much pure and applied scientific 
research has focused on replicating biologi- 
cal functions in digital hardware. Here, at 
the Logic Systems Laboratory of the Swiss 
Federal Institute of Technology in Lausanne 
(EPFL), we have utilized 5,700 Xilinx 
Spartan™ FPGAs, in multiple configura- 
tions, to build bio-inspired computing 
machines that exploit three essential biolog- 
ical models: 


¢ Phylogenesis — the history of the 
evolution of the species 


¢ Ontogenesis — the development of 
an individual as directed by his 
genetic code 


¢ Epigenesis — the development of an 
individual through learning processes 
(nervous system, immune system), 
influenced both by genetic code 
(the innate) and environment 


(the acquired). 


Although we have individually and jointly 
investigated all three models, we have concen- 
trated on the ontogenetic model through the 
Embryonics (embryonic electronics) Project. 
This project studies the development of 
multi-cellular organisms for the purpose of 
obtaining in digital hardware some of the fea- 
tures of biological organisms, notably growth 
and fault tolerance. 

Our work has attracted a flattering 
amount of interest in the most varied and 
sometimes unexpected milieus. Among the 
most unexpected sources of funding and 
support came from Mrs. Jacqueline Reuge. 
Mrs. Reuge decided to fund the construc- 
tion of the BioWall to display the principles 
of embryonics within a museum built to 
honor the memory of her late husband. 

Her generous support has allowed us 
to maintain our tradition of verifying our 
theoretical concepts in hardware. Without 
Mrs. Reuge’s support, we could not have 
constructed a computing machine of such 
magnificent proportions. 

We named this machine Bio Wall because 
of its biological inspiration, as well as its size 
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Figure 1 — The BioWall reacts to touch. 


(5.3m x 0.6m x 0.5m = 3.68m°, or 130 
cubic feet). The main purpose of creating 
the BioWall is to demonstrate the features 
of our embryonics systems to the public 
through a visual and tactile interaction 
(Figure 1). 


Transparent 
touch-sensitive 
| element 





Two-color 
LED display 


Interconnections with neighbor cells 


Figure 2 — The BioWall’ basic building block 


Bio-Inspired Machines 
We implemented — for the first time in 
actual hardware — an organism endowed 
with all of the features of an embryonics 
machine, as it has often been defined in the 
literature. One of the functions of our 
organism is the BioWatch, which counts 
hours, minutes, and seconds. It demon- 
strates the growth and self-repair capabili- 
ties of our systems. 

The implementation of the BioWatch 
would have been sufficient to justify the 





effort that went into the construction of 
our embryonic BioWall. However, in 
developing our machine, we quickly real- 
ized that the capabilities of such a plat- 
form were not limited to a single 
application. In fact, it is an ideal platform 
to prototype many different kinds of two- 
dimensional cellular systems, which are 
systems comprising arrays of small, locally 
connected elements. 

For example, “cellular automata” (CA) 
are very common environments in bio- 
inspired research. The BioWall is ideally 
suited to the implementation of CAs, but it 
is by no means limited to it. We have only 
begun to explore the possibilities of the 
BioWall as a research tool. 


Xilinx Behind the Scenes 

We built the BioWall to demonstrate an 
embryonic machine. The structure of such 
machines is hierarchical: Organisms (appli- 
cation-specific systems) are created by the 
parallel operation of a number of cells 
(small processors). Each cell is implement- 
ed as an array of molecules (programmable 
logic elements). 

To implement this kind of machine, the 
BioWall is structured as a two-dimensional 
tissue comprising units (each unit corre- 
sponds to a molecule). As shown in Figure 2, 
each unit consists of: 


e An input element (a touch-sensitive 


membrane) 


¢ An output element (an array of 64 
two-color LEDs) 


e A programmable computing element 


(a Xilinx Spartan XCS10XL FPGA). 
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Figure 3 — Front and back view of 25 FPGAs assembled on a BioWall panel 


The circuits are mounted on double 
boards. The logic board hosts 25 FPGAs, 
and the display board holds the displays 
and membranes (Figure 3). The two 
boards are rigidly bound together and 
connected by a bus to allow two-way com- 
munication between the logic and the dis- 
play (a dedicated circuit on the logic board 
automatically distributes the signals to and 
from the displays). 

On the logic board, the Spartan devices are 
placed in a regular two-dimensional grid. A 
subset of the pins of each FPGA (approxi- 
mately 20 per side) are used to make a direct 
pin-to-pin connection between each circuit 
and its four cardinal neighbors. The pins of 
the FPGAs placed along the edges of the 
board are brought to a set of connectors to 
allow the pin-to-pin association to continue 
across boards (Figure 4), thus creating per- 
fectly uniform surfaces of FPGAs spanning as 
many boards as required. 

The remaining pins are connected to a 
centralized circuit that handles the distribu- 
tion of the global signals (the clocks, resets, 
and FPGA configurations) arriving from 
the outside. 

We have built 228 such boards (not 
including spare materials) for a total of 
5,700 units. The architecture of the boards 
implies that they can be seamlessly connect- 
ed with each other to form a uniform sur- 
face of any shape and size. Throughout the 
development phase and lifetime of the 
machine, we have so far constructed several 
independent machines: 


¢ A 3,200-unit machine (Figure 5) 
displayed at the Villa Reuge museum 


¢ A 2,000-unit machine kept in our labora- 
tory to develop and test new applications 
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Figure 4 — Connectors allow pin-to-pin 
association to join multiple boards. 


¢ A 150-unit machine embedded 
together with the necessary control 
logic to charge applications into a 
suitcase for portability 


¢ A 4,000-unit machine that will be on 
display at the Telecom ’03 conference 
in Geneva in October 2003. 


This tissue of 5,700 FPGAs represents 
an impressive amount of computational 
power, coupled with I/O interfaces (com- 
prising the membranes and LED arrays) 
that allow for large-scale visual and tactile 
interaction. The advantage of this solution 
is the size of the display, which enables an 
immediate interaction with applications 
normally limited to software simulation on 
a computer screen. Furthermore, the com- 
puting power and programmability of the 
Xilinx FPGAs enables the prototyping of 
new bio-inspired systems. 

In the current version of the BioWall, 


the Xilinx FPGAs can only be programmed 
with the same configuration, which limits 
the functionality of the units to the 10,000 
equivalent logic gates of Spartan devices. 
The considerable delays inherent in propa- 
gating a global signal over distances meas- 
ured in meters seriously limit the clock 
speed. Considering the role of the BioWall 
as a demonstration tool, we have not tried 
to push the clock to its limits, as the cur- 
rent frequency of 1 MHz is more than ade- 
quate if coupled with the massive 
parallelism of the machine. 

Besides the I/O capabilities of the mem- 
branes and LED displays, a set of modules 
placed on the borders of the machine 
allows the tissue to be interfaced with stan- 
dard logic either via a PC or directly with 
user-defined modules. The modules allow 
access only to the borders of the array, but, 
if necessary, signal propagation logic can be 
programmed in the FPGAs. 

The software tools developed for the 
BioWall are rudimentary but complete. A 
simple interface on a PC allows users to 
define a set of files to configure the tissue. 
Four kinds of files are currently defined: 
the configuration file for the Xilinx 
FPGAs, and three different formats used to 
send user-defined data on the input pins at 
the borders of the tissue (used, for example, 
to provide an initial configuration for a cel- 
lular automation). The values on the out- 
put pins at the borders of the tissue can be 
read by the PC and either stored on disk or 
used as required. 


Applications 

The cellular structure of the BioWall is well 
suited to the implementation of all sorts of 
bio-inspired applications. The BioWall can 
exploit the versatility inherent in its pro- 
grammable logic and in its architecture to 
implement hardware inspired by all the 
three models of biological inspiration: phy- 
logenesis, ontogenesis, and epigenesis. 


BioWatch 

To illustrate the implementation of the 
BioWatch application on the BioWall, we 
will introduce a slightly simplified exam- 
ple. Whereas the complete BioWatch is an 
organism capable of counting hours, min- 
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Figure 5 — The BioWall on display in the Villa Reuge Museum 


utes, and seconds, the “counter applica- 
tion” we describe here only counts seconds. 
Otherwise, the principles of operation of 
the two machines are identical. 

The counter counts seconds, from 00 to 
59. From left to right, the display shows 
tens of seconds (from 0 to 5), units of sec- 
onds (0 to 9), and a spare zone, which 
remains inactive during normal operation 
(Figure 6a). The counter is divided into 
four cells: two active (indicating tens and 
units, respectively) and two spare. Each 
unit of the BioWall is a molecule of the 





cell 1 cell 2 cell 3 cell 4 
tens of units of spare spare 
seconds seconds 


Figure 6a — Counter implemented on Bio Wall 
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Figure 6b — Hierarchical structure 
of the counter application 
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embryonics hierarchy. A cell is then a 
mosaic of (20 x 25) 500 molecules (Figure 
6b), and contains two repair columns for a 
total of (2 x 25) 50 molecules. 

You have control over the “life” of each 
molecule. A stuck-at-fault can be inserted 
in any molecule simply by pressing on the 
corresponding unit’s membrane. The fault 
detection mechanism included in the 
embryonics molecular layer (embedded 
into the programmable logic of the Spartan 
FPGAs) automatically detects the error and 
activates the molecular self-repair mecha- 
nism. A “dead” molecule is instantly 
replaced by the neighbor immediately to its 
right, and so on, until the nearest repair 
column (Figure 7a). 

The limits of this kind of self-repair 
imply that only a single molecule per line, 
between two repair columns, can be killed. 
If this constraint is respected, the cell sur- 
vives any amount of faults, although the 
figure displayed is distorted. Each cell can 
thus tolerate up to two faults per line (one 
fault between each pair of repair columns), 
equaling (2 x 25) 50 faults in total. 

If the above rule is not respected, and 
several faults are inserted on the same line 
of the same cell between two repair 
columns, the molecules can no longer 
and the cell dies. 
However, the death of a cell does not imply 


repair themselves 





healthy display one error on three errors on 
one line three lines 
Figure 7a — Molecular self-repair 


of the counter application 
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Figure 7b — Cellular self-repair 
of the counter application 


the death of the organism. It is instantly 
replaced by a spare cell to its right 
(Figure7b), while the dead cell is switched 
off and becomes a scar. 

Throughout this self-repair process, the 
counter continues to work without fault. 
The tissue remembers its state and recovers 
the correct time after repair. Moreover, we 
are currently implementing an “unkill” 
mechanism to address the issue of transient 
faults. If a sufficient number of faults are 
removed (by pressing the membranes of 
dead molecules), this mechanism will auto- 
matically re-activate a dead cell, which will 
recover its functionality (and its state) 
within the organism. 

The self-repair capabilities of the 
embryonics machines are based on a gener- 
al principle of life — cell differentiation. 
Each organism is a collection of cells, each 
containing a full copy of the genetic pro- 
gram, the genome. This structure makes 
the whole organism extremely robust, 
because each cell contains the complete 
plan and can therefore replace any other 
defective cell. 

Nevertheless, like all artificial and natu- 
ral organisms, the death of a sufficiently 
large number of cells cannot be repaired, 
causing the death of the organism. The 
advantage of the controlled environment in 
which the machine operates is that the 
death of the organism causes a general reset 
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of the system, the obliteration of all inject- 
ed faults, and the “birth” of a new, perfect- 
ly functioning machine. 

The complete implementation of the 
BioWatch on the BioWall uses eight cells of 
(20 x 20) 400 molecules each, with two spare 
columns of molecules in each cell. Six of the 
eight cells are active during normal opera- 
tion, while two are spares, ready to replace a 
dead cell. All the theories of the Embryonics 
Project have been tested and verified in hard- 
ware through this implementation. 


Self-Replicating Loops 

Initiated by von Neumann more than 50 
years ago, the study of self-replicating 
computing machines has produced a 
plethora of results. Much of this work is 
motivated by a desire to understand the 
fundamental information processing prin- 
ciples and algorithms involved in self- 
replication independently of their physical 
manifestation. The construction of artifi- 
cial self-replicating machines can have 
diverse applications ranging from nan- 
otechnology to space exploration to recon- 
figurable computing tissues. 

To render the self-replication process 
more interactive and visible, we implement- 
ed self-replicating loops on the BioWall, ini- 
tially of size 2 x 2 and then of variable size 
(Figure 8). In this implementation, every 
unit of the BioWall is one cell of the CA. 
Pressing on the membrane of a unit belong- 
ing to a loop causes the unit to replicate in 
one of four cardinal directions (Figure 9). 


Turing Neural Networks 
In 1948, Alan Turing wrote a little-known 
report entitled “Intelligent Machinery.” 
Turing never had great interest in publiciz- 
ing his ideas, so the paper went unpub- 
lished until 1968, 14 years after his death. 
Few people know that Turing’s 
“Intelligent Machinery” paper contains a 
fascinating investigation of different con- 
nectionist models that would today be 
called neural networks. In describing ran- 
domly connected networks of artificial 
neurons, Turing wrote one of the first man- 
ifests of the field of artificial intelligence 
(although he did not use this term). 


Recently, we implemented Turing’s 
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Figure 8 — Self-replicating loops on the BioWall 





Figure 9 — Self-replication of a set of loops 





Figure 10 — Turing neural networks on the BioWall 





Figure 11 — Two-dimensional Firefly 


neural networks on the BioWall’s reconfig- 
urable tissue (Figure 10). Each of the 
3,200 units of the machine can be interac- 
tively configured by choosing one out of 
five possible functions: 

1. Empty cell 


2. Neuron 

3. Connection 
4, Synapse 

5. Input cell. 


You can discover and affect the behavior 
of this “unorganized” machine by opening 
and closing synapses, “organizing” the 
machine and modifying the network’s 
inputs. All modifications occur by simply 
pressing on the touch-sensitive mem- 
branes. This application is first and fore- 
most a demonstration of Turing’s neural 
networks on reconfigurable hardware (and 
to the best of our knowledge, the first one). 
However, it also exemplifies the fusion of 


the ontogenetic and epigenetic models in a 
single artificial tissue. 


Firefly 

In 1997 the Logic Systems Laboratory pre- 
sented an evolving hardware system called 
Firefly, based on a cellular programming 
approach, in which parallel cellular 
machines evolve to solve computational 
tasks. The computational task studied — 
and successfully solved — is known as “syn- 
chronization”: Given any initial configura- 
tion, the nonuniform CA must reach, 
within M time steps and using only local 
connections, a final configuration in which 
all cells oscillate synchronously between all 
Os and all 1s on successive time steps. 

The novelty of Firefly is that it operates 
with no reference to an external device, 
such as a computer that carries out genetic 
operators. Thus, the Firefly demonstrates 
online autonomous evolution. 

The original Firefly machine was able to 
find a solution for a one-dimensional CA. 
Subsequently, we have been able to evolve, 
on the BioWall’s 3,200 FPGAs, a CA that 
solves the synchronization task in two 
dimensions. 

The implementation on the BioWall 
(Figure 11) consists of a two-state, nonuni- 
form CA, in which each cell (FPGA) may 
contain a different rule. The cells’ rule 
tables are encoded as a bit-string, known as 
the genome. This genome has a length of 
25 = 32 bits for our two-dimensional CA 
(the binary CA has a neighborhood of 5). 

Rather than employ a population of 
evolving CAs, our algorithm evolves a sin- 
gle, nonuniform CA the size of the entire 
BioWall (one cell of the CA in each unit 
of the BioWall, or 3,200 cells) whose rules 
are initialized at random. Initial configu- 
rations are then randomly generated and 
for each configuration the CA is run for 
M time steps. 

Each cell’s fitness is accumulated over C 
initial configurations: a single run’s score is 1 
if the cell is in the correct state after M + 4 
iterations, and 0 otherwise. The local fit- 
ness score for the synchronization task is 
assigned to each cell by considering the last 
four time steps (M + 1 to M + 4). If the 
sequence of states over these steps is pre- 
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cisely 0-1-0-1, the cell’s fitness score is 1; 
otherwise this score is 0. 

After every C configuration, the rules 
are evolved through crossover and muta- 
tion. This evolutionary process is per- 
formed in a completely local manner; that 
is, genetic operators are applied only 
between directly connected cells. 

Unlike standard genetic algorithms, 
where a population of independent prob- 
lem solutions globally evolves, our 
approach involves a grid of rules that co- 
evolves locally. The CA implemented on 
the BioWall performs computations in a 
completely local manner, each cell having 
access only to its immediate neighbors’ 
states. In addition, the evolutionary process 
is also completely local, because the appli- 
cation of genetic operators as well as the fit- 
ness assignment occurs locally. 

Using the above-described cellular pro- 
gramming approach on the BioWall, we 
have shown that a nonuniform CA of 
radius 1 can be evolved to successfully solve 
the synchronization task. In addition, after 
having found a set of successful rules, our 
machine allows the state of each CA cell to 
be changed by pressing on its membrane. 
You can then observe how the machine re- 
synchronizes the 3,200 cells. 


DNA Sequence Comparison 

The comparison and alignment of charac- 
ters taken from a finite alphabet is a funda- 
mental task in many applications, ranging 
from full-text search to computational biol- 
ogy. In particular, string comparison is a crit- 
ical issue in the field of molecular biology. 

In fact, both DNA fragments and pro- 
teins can be represented as sequences of 
characters (taken from alphabets of four 
and 20 symbols, respectively). Sequence 
similarities provide useful information 
about the functional, structural, and evolu- 
tionary relationships between the corre- 
sponding molecules. 

Biological sequences may differ because 
of local substitutions, insertions, and dele- 
tions of one or more characters. The com- 
plexity of string comparison comes from 
the large number of possible combinations 
of these three basic mutations. 

The similarity between two strings can 
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Figure 12 — The Needleman-Wunsch 
algorithms in two dimensions 





Figure 13 — The Needleman-Wunsch algorithm 
implemented on the Bio Wall 


be evaluated either in terms of edit distance 
or similarity score. The edit distance is a 
measure of the minimum number of edit 
operations (mutations) required to make 
the two strings equal to each other. 

The similarity score is a measure of the 
maximum number of residual matches 
between the two strings. Both metrics can 
be evaluated in polynomial time by means 
of dynamic programming techniques. 

The key algorithm for evaluating the 
similarity between two strings of length N 
and M was developed by Needleman and 
Wunsch and takes O (N x M) steps to com- 
plete execution. The two-dimensional 
structure of the algorithm (Figure 12) 
makes it suitable for a parallel implementa- 
tion on a systolic array. 

In particular, hardware parallelism can 
be exploited to perform string comparison 
in O (N + M) steps. In this experiment, we 
present a parallel implementation of the 
Needleman-Wunsch algorithm on the 
BioWall (Figure 13). 

The BioWall cannot compete in perform- 
ance with existing parallel implementations 
of the Needleman-Wunsch algorithm, 


because it suffers from the typical performance 
limitations of a large prototyping platform. 
Nevertheless, the implementation of the 


Needleman-Wunsch 


BioWall is a significant design experiment in 


algorithm on _ the 


the field of reconfigurable computing, because 
of the peculiarities of the target architecture. 


Conclusion 

The current configuration of the BioWall is a 
mosaic of more than 3,000 transparent elec- 
tronic modules. Each module enables visitors 
to communicate with the surface simply by 
touching it with their fingers. The BioWall 
calculates its new status and indicates it imme- 
diately on an electronic display. The usefulness 
of this approach has been demonstrated 
through a number of experiments. 

In fact, the applications presented here 
are just a small sample of the capabilities of 
the BioWall — capabilities that we are still 
discovering. The cellular structure of the 
machine makes it an ideal platform for pro- 
totyping bio-inspired systems. 

The size and structure of the BioWall 
impose a certain number of limitations, such 
as clock speed. But its complete programma- 
bility provides outstanding versatility, and 
the visual and interactive components of the 
system are invaluable tools both for the dis- 
semination of ideas as well as the verification 
of research concepts often limited to soft- 
ware simulations. 

Some of the other bio-inspired applica- 
tions we have implemented or plan to imple- 
ment on our machine include L-systems, ant 
simulations, predator-prey environments, 
other kinds of CAs, and more conventional 
artificial neural networks. 

We invite you to come and “play” with the 
machine at one of the events at which it will 
be on public display, or even at our laboratory. 

Finally, we are extremely interested in 
putting our machine at the disposal of other 
research groups interested in hardware real- 
izations of their ideas and concepts. For 
more information, please visit us at 


Lslwww.epfl.ch/biowall!. & 


[Daniel Mange, André Stauffer, Fabien Vannel, 
André Badertscher, and Enrico Petraglio also 
contributed to this article.] 


Photos: © André Badertscher, Alain Herzog, EPFL. 


Xcell Journal 45 





Formula! Racing: 
The Xilinx Advantage 


The BMW WilliamsF1 team litts the hood on a 


its fitth-generation vehicle control system. 
aye 
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A modern Formula 1 (Fl) car communi- 
cates an enormous amount of data to the 
engineering teams that support it. How the 
teams collect, analyze, and react to that 
data is vital to competitiveness. The collect- 
analyze-and-respond process is also subject 
to change, as the sport’s rule-makers aim to 
reduce onboard electronics to control costs 
and achieve closer racing competition. 

Throughout the last decade, the electron- 
ics department at BMW WilliamsF 1, led by 
Steve Wise along with engineers Dave 
Walker and Ian Cartwright, can claim to 
have been in the driver’s seat as onboard elec- 
tronics have spread to every extremity of a 
Formula 1 chassis. 

The team is now running its fifth-gener- 
ation VCM (vehicle control and monitor- 
ing) unit; the current Stage 5 VCM uses a 
Xilinx Virtex™M-E XCV600-E FPGA along- 
side a Texas Instruments™ DSP. Shown in 
Figure 1, the VCM controls all aspects of the 
2003 FW25’s chassis (other than the 
engine). And the team is continuing to 
extend and refine the unit’s capabilities. 


Adapting to Rule Changes 

Rules change quickly in F1 racing. Teams 
must be able to respond with a competitive 
solution that meets every new detail. The 
regulatory landscape has been especially 
volatile over the last decade. Driver aids 
such as active suspension, anti-lock brak- 
ing, traction control, and launch control 
all have been banned at one time. Some 
driver-assist features have been granted a 
reprieve for this year — only to be com- 
pletely outlawed for the 2004 season. 

The ability to quickly add and remove 
functional blocks is therefore critical to 
each team’s ability to compete and to 
demonstrate compliance. 

Successive generations of the BMW 
WilliamsF1 VCM have made great use of 
FPGAs to achieve rapid compliance. This 
flexibility has allowed BMW WilliamsF1 
to migrate more and more sophisticated 


functions into hardware as new generations 
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Figure 1 — The VCM controls all of the race car’ chassis functions except the engine. 


of FPGA silicon offer extra capacity and 
complexity. 


Controlling an Fl Car 

The VCM is a data logger and processor on 
a grand scale. Its vehicle control functions 
include overseeing the hydraulically actuat- 
ed gear change, which calls for the VCM to 
initiate a gear shift either in response to a 
driver request or automatically. 


Shifting Gears 

The gear change sequence in a F1 gearbox 
requires precise control over the positions 
of the gearbox actuators, as well as control- 
ling the clutch and coordinating engine 
revolutions per second with the BMW 
engine controller. A typical gear change 
takes less than 50 ms. The system also pre- 
vents the driver from damaging or over- 
revving the engine. 

For the 2003 racing season, the VCM is 
allowed to initiate all gear changes with no 
input from the driver. But automatic gear 
changes will be outlawed for 2004, requir- 
ing the driver to initiate each gear change 
manually. Once the driver has initiated the 
change, the VCM will be allowed to handle 


the rest of the gear change sequence. 


Traction 
The VCM also handles traction control by 


performing calculations based on complex 


tire models to predict the amount of wheel 
slip required to achieve maximum traction 
and minimum tire wear. The VCM calcu- 
lates control targets and sends this data to 
the engine controller. 

Generating control signals for the 
hydraulically actuated differential also 
comes under the VCM’s domain — with the 
aim of providing maximum traction from 
each of the rear wheels to optimize stabili- 
ty of the car when cornering. 


Launch Control 

The use of “launch control” — which opti- 
mizes the car’s standing start as it leaves the 
grid — has received much press coverage in 
recent seasons. Incoming rules will outlaw 
this feature, but for the 2003 season, the 
launch start controller is a functional block 
within the VCM. Clutch and engine tar- 
gets communicated by the VCM help the 
car to accelerate from the grid to 100 m/hr 
in less than three seconds. 


Real-Time Feedback 

The VCM displays driver information sig- 
nals and warning indicators on the steering 
wheel. Even though a modern F1 steering 
wheel can cost upwards of $33,000, there is 
relatively little intelligence onboard — that is, 
until next years BMW WilliamsF1 team 
adds a Xilinx FPGA to perform some of the 
instrumentation processing functions locally. 
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Logging and Telemetry 

The VCM logs around 220 channels of 
data at rates as high as 1 KHz. As many as 
90 of these channels are signals from sen- 
sors used to analyze the car’s per- 


formance over an entire race or 





test distance. Some channels 
monitor the actions of the con- 
trol software, while others moni- 
tor the drivers inputs. An 
example of the logged data is 
shown in Figure 2. 

Each of the Stage 5 VCMs fit- 
ted to the two FW25 cars of driv- 
ers Juan Montoya and Ralf 
Schumacher can log 256 MB of 
data generated by the cars during 
a race. This data is stored on a 
standard, commercial Compact- 
Flash™ (CF) card mounted per- 
manently inside the VCM, which 
provides convenient and cost- 
effective non-volatile storage. The 
CF card contents are downloaded 
by wire link after the race for analysis. 

A subsection of the logged data is also 
transmitted via serial link to a telemetry 
transmitter on the car. This transmits the 
data in real time back to the garage where 
it is displayed on a PC, enabling techni- 
cians to monitor a car’s performance from 


the pit. 


Failure Analysis and Reporting 

Embedded in the VCM are functions to 
detect failures of sensors on board the car. 
The output of these routines prevents 
failed sensors from being used by the con- 
trol algorithms, and allows problems to be 
identified quickly. 

The failure of a small sensor cannot be 
allowed to ruin a driver’s race. To prevent 
this, the VCM monitors inputs from mul- 
tiple sensors and acts on the majority deci- 
sion in each case. 

The level of redundancy is determined 
according to the importance of the func- 
tion, the vulnerability of the sensor, and 
the potential weight penalty. The VCM is 
capable of monitoring as many as 100 
input sensors, while controlling numerous 
hydraulic systems. 

This failure detection capability is aug- 


48 Xcell Journal 


Se) eee oe 


mented by pit-to-car telemetry. The race 
engineer can arbitrate against a sensor or set 
of sensors believed to be providing inaccu- 
rate signals. 
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Figure 2 — Data from more than 90 sensors monitor the car’s performance. 


Extra Processing Horsepower 

BMW’’s engine designers would say that a rac- 
ing car can never have too much horsepower. 
BMW WilliamsF1’s electronic designers have 
a similar opinion about processing capability: 
More MIPS (million instructions per second) 
allow them to consolidate functions within 
the VCM, control more aspects of the car, 
and use more sophisticated models to gener- 
ate accurate control signals. 

Despite the rule-makers’ attempts to 
simplify and reduce the cost of F1 technol- 
ogy, the processing demands on the VCM 
continue on a steeply upward trend. 

Paradoxically, rule changes for 2004 pro- 
hibiting pit-to-car telemetry, which are 
intended to reduce the complexity of elec- 
tronics, will drive the processing load higher 
in future years. The decision-making per- 
formed by human operators in the garage 
must now migrate to the car itself. 

Dave Walker and his team are faced 
with an ever-increasing demand for pro- 
cessing power. But they do not enjoy a cor- 
responding increase in thermal budget or 
physical size limits. The growing list of 
duties for the VCM, originally used only as 
a pure data logger, has increased processing 


requirements. 
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Hardware Solution 

With 220 channels to monitor, as well as 
numerous control algorithms to execute, 
Dave Walker says processor cycles are far 
too precious to waste on 
basic input functions, such 
as filtering or controlling A- 
to-D conversions. 

Among the potential 
solutions to reducing proces- 
sor load, the BMW 
WilliamsF1 team did not 
favor a microcontroller 
because of the need to write 
software and maintain the 
code base. In addition, the 
software inspection demands 
imposed by the FIA make 
the use of a microcontroller 
even less desirable. 

A pure hardware solu- 
tion is more optimal, 
according to Walker. Before 
the advent of modern 
multi-MFLOP (mega floating-point opera- 
tions per second) DSPs, it was only possi- 
ble to meet the timing demands for data 
logging and controlling high-speed 
hydraulic actuators by using hardware to 
accelerate certain functions. 

The team has used field programmable 
gate arrays from a variety of vendors since 
the first-generation design, but BMW 
WilliamsF1 has now been working exclu- 
sively with Xilinx for a number of years. 
The current Stage 5 VCM uses the Xilinx 
Virtex™M-E XCV600-E FPGA alongside a 
Texas Instruments™ DSP. 

Although processing capabilities of 
DSPs have advanced dramatically since the 
first versions of VCM entered service, the 
team finds the hardware processing capa- 
bilities of the companion FPGA a valuable 
resource, which allows it to focus on using 
the DSP for pure number crunching. 

There will never be a surfeit of process- 
ing power, as this can always be consumed 
by the increasingly sophisticated models 
provided to the VCM. These models 
demand more accurate control outputs for 
gearbox, differential management, and 
communications with the engine. Today’s 
XCV600-E is around 85% utilized. The 
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team plans to migrate to a more complex, 
next-generation Virtex-II Pro™ FPGA in 
the VCM incarnation for 2004. 

The team also plans to use Xilinx 
Titanium and Platinum technical support 
packages, which will include on-site design 
and training by expert Xilinx engineers. 

Accordingly, the sixth-generation BMW 
WilliamsF1 VCM will perform even more 
processing, with a more powerful DSP chip 
alongside a more complex Xilinx FPGA. 
The Virtex-I] Pro™ platform FPGA will 
also be able to offload additional DSP 
functions by virtue of its onboard multiply- 
and-accumulate (MAC) block to deliver a 
greater boost in effective system MIPS. 

For 2004, the Virtex-II Pro device will 
host new programmable filtering and 
communication protocol control func- 
tions. The platform FPGAs will also man- 
age UARTs and PWMs (pulse-width 
modulators) that were formerly imple- 
mented in separate components on the 
printed circuit board. 

This ability to progressively soak extra 
discrete logic into the FPGA — at the same 
time as assimilating software routines — has 
enabled the VCM’s footprint to shrink sig- 


nificantly over its lifetime. 
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Conclusion 

For those determined to win, Formula 1 has 
always demanded the utmost speed and 
nimbleness — not only on the racecourse, 
but also in terms of technical innovation. 
Rule changes introduced at the start of 2003 
have taxed all teams’ abilities to respond 
quickly and effectively. 

At the vehicle control level, engineers must 
to be able to add and remove functional 
blocks from the VCM within a short time- 
frame to meet FIA regulations. The teams 
must also be able to incorporate performance 
modifications as the current season progresses. 

Fast feature swapping — combined with the 
ability to consolidate more of the low-level 
hardware and discrete logic into an FPGA to 
save weight, space, and power dissipation — is 
crucial to BMW WilliamsF1’s ability to 
demonstrate compliance with FIA rules. 
Moreover, Xilinx FPGA reprogramability 
enables the team to innovate quickly to main- 
tain a competitive edge over rival teams. 

To learn more about the Xilinx partnership 
with the BMW WilliamsF1 team, go to 
www.bmw.williamsfl.com and click on 
“Xilinx Joins the Team” under Partner News. 
To get the latest Formula 1 standings, go to 
www.formulal.com. & 








As of this writing (July 8), the BMW 
WilliamsF1 team had back-to-back 
wins, taking first and second place at 
both the European Grand Prix at 
Nurburgring and the French Grand Prix 
at Magny-Cours. Earlier this year, the 
team posted its first win of the season in 


the Monte-Carlo Grand Prix at Monaco. 


According to a July 7 Formula 1 
press release: “Williams has suddenly 
vaulted past both McLaren and Ferrari 
to establish itself as the team most in 

form ... If proof were needed of the 
growing Williams menace, in the last 
five races McLaren, initially the 2003 
series leader, has scored 34 points; 
Ferrari has scored 55; and 
Williams has scored 70." 


In overall standings, BMW WilliamsF1 


was trailing the series leader Ferrari 


by just three points. 
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by Felix Meier 
Senior Engineer 
Girsberger Elektronik AG 
felix.meier@smile.ch 


Every year in Switzerland, approximately 
27 winter sports enthusiasts lose their 
lives in avalanches. If caught in an ava- 
lanche, your life depends upon the speed 
of your companions as rescuers to locate 
you and dig you out. After only 30 min- 
utes, your chance of survival is just 40%. 
Speed is of the essence. 

One rescue tool is an avalanche bea- 
con, a small portable unit that transmits a 
beep about every second. Using one sig- 
nificantly increases the chances of being 
quickly and successfully rescued if buried 
by an avalanche. Your companions need 
only to switch their units to “receive” 
mode and locate your beeps. 

As with any rescue and survival skill, 
the correct use of avalanche beacons 
requires education and _ practice. 
However, for use in classroom situations, 
the signals from regular avalanche bea- 
cons are too strong to allow for a mean- 
ingful demonstration of the basic search 
principles. 

At Girsberger Elektronik AG, we 
developed a special reduced-strength 
transmitter for search training (Figure 1). 
It meets our key requirements: low cost, 
battery efficient, and compatible with the 
various timing signals (that is, the interval 
and duration of beeps) of all beacon 
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brands on the market. We use a 73.768 KHz 
watch oscillator followed by a Xilinx 
CoolRunner™-II device to generate the 
timing signals (Figure 2). This provides an 
optimal solution, saving on both power 
consumption and PCB real estate. 





Figure 1 — Training transmitter 


1/0 Port Expanders 
The best practical training application for 
beacons is, of course, to bury the transmitters 
into mountain snow. However, changing the 
location of the transmitters to build new 
search scenarios cost valuable training time. 
By designing a control console (Figure 3) for 
as many as 16 remote transmitters activated 
via radio link, we enabled trainers to quickly 
and easily simulate new accident situations. 
In order to handle 16 remote transmit- 
ters we required more I/O lines than what is 
available on a microprocessor. We chose to 
use two Xilinx CoolRunner-II devices, 
which provide a compact solution with low 
console battery consumption. 


PCM Codec to RAM Interface 

We also found CoolRunner-I] CPLDs 
effective in reducing transmission loss. To 
initiate transmission over a private mobile 
radio, you must press a PTT (push to 
talk) button and wait before the transmit- 
ter is ready for you to start speaking. If 
you're under stress, there’s a strong ten- 
dency to start talking before the transmit- 
ter is ready, and the beginning of your 
transmission will be lost. You may have to 
repeat the whole transmission in order for 
the receiver to make sense of it. This is 
inconvenient and, especially in emer- 
gency situations, can cause additional 
stress and frustration. 
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Figure 2 — Internal view of training transmitter 
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Figure 3 — Transmitter control console 
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Figure 4 — Digital audio delay line 


In some cases the PTT button is 
replaced by a voice activity detector, which 
has an inherent turn-on delay, meaning that 
the beginning of your transmission will 
always be lost. 

To eliminate loss at the beginning of 
transmissions, we combined a PCM codec 
with a static RAM to implement a digital 
audio delay line, which holds up the voice 
signal (Figure 4). 

The PCM data transfers in to and out of 
the codec over a high-speed 2.048 MHz 
serial bus (Figure 5). Encoded PCM samples 
are converted to parallel data and written 
into a ring buffer in the RAM. By reading 
parallel data from the RAM location with a 
fixed offset (modulo the size of the ring 
buffer) to the write address, the output is 
delayed. The parallel data is then reconvert- 
ed and fed into the codec over a serial bus. 


in 
analog 
out 


— 


. 
. 2.048 Mb 





Figure 5 — Codec to RAM 
interface block diagram 
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One PCM sample reads/writes every 125 
microseconds. The delay can be adjusted by 
modifying the read-to-write address offset. 

These operations, in particular the 
emulation of the serial bus, cannot be per- 
formed by a standard microprocessor, 
which would be too slow. We chose a 
Xilinx CoolRunner-II device with 64 
macrocells for the interface between the 
codec and the RAM. This solves our speed 
problem and also saves PCB real estate that 
would have otherwise been taken by stan- 
dard logic components. 


The Development Process 

Excellent support from Impact Memec, 
the local Xilinx representative, kept the 
development start-up time extremely 
short. We used ISE WebPACK™ software 
for all of the designs, and found it espe- 


a 
x 
& 


AD(12:0 


control 





delay 


cially helpful for the following reasons: 


¢ Access to error messages on the Xilinx 
website provided instant in-depth 
information about the particular mes- 
sage, often with hints on how to solve 


the problem. 


¢ The designs were simulated using 
ModelSim XE Starter, leading to some 
design modifications and allowing flaws 
to be fixed early in the development 
process. Some problems were due to 
signal spikes on clock lines, which were 
properly indicated by the simulator. 


All designs loaded and operated cor- 
rectly on the first attempt when moved to 
the target hardware. We have no doubt 
that the design environment of the Xilinx 
CoolRunner-II family of CPLDs is mature 
and sound. 


Conclusion 

Xilinx CoolRunner-II devices are well 
suited for our applications. They provide 
a reliable technical solution combined 
with a short development time and a low 
start-up cost. 

Based on this positive experience, 
Girsberger Elektronik AG is planning to 
incorporate Xilinx CoolRunner-II devices 
in such products as: 


¢ The upgrade and replacement of an old 
avalanche beacon ASIC to eliminate 


the high NRE cost. 


¢ A beacon to locate stolen cars. By tak- 
ing over part of the training transmitter 
CPLD design, we will save significant 
development time. 


¢ A data over voice interface for use on 
ISDN B-channels. The CPLD beats 
the speed of any microprocessor in 
manipulating the bitstream on an 
industry-standard IOM bus. 


Although CPLDs are often associated 
with high speed or high density, Xilinx 
CoolRunner-II devices are an interesting 
alternative for small-scale applications in 
which top speed and high density are not 
key concerns. For more information 
about Girsberger Elektronik AG, visit 
www.girsberger-elektronik.chl. & 
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Boeing Harnesses 
Virtex-Il Pro Plattorm 
for Image Processing 


Designing a state-of-the-art airborne imaging system requires 
massive parallel processing, high-speed |/Os, and heavy-duty 
signal processing. Boeing's design team found all three plus 

configurability in the Virtex-lI Pro Platform FPGA. 


alll 





a 


Summer 2003 





by Christopher Musial 

Real Time Systems Engineer 

The Boeing Co. 
christopher.j.musial@boeing.com 


Katie DaCosta 

Solutions Marketing Manager, DSP Solutions 
Xilinx, Inc. 

katie. dacosta@xilinx.com 


Over the past 10 years, the aerospace 
industry has adopted the use of commer- 
cial off-the-shelf (COTS) electronics in 
favor of developing custom systems. This 
change has been fueled in part by the high 
development costs associated with ASICs 
or specialized software processors. 

As a result, commercial suppliers seek- 
ing the best available technology have 
focused on the design and fabrication of 
multi-processor hardware. Today, many of 
these systems feature DSP devices or 
RISC-based CPUs. Although Boeing has 
successfully integrated these solutions into 
many systems, the application of COTS to 
high-end signal- and image-processing 
designs has remained very challenging. 

Even “high-end” DSP and. RISC archi- 
tectures offer limited parallelism, memory 
flexibility, interface capability, and determin- 
ism. The integration of dozens of processors 
in a hard real-time architecture is non-trivial. 
The complications of packaging, weight, vol- 
ume, thermal management, and power con- 
sumption to system design are obvious. The 
challenges often appear to grow exponential- 
ly with the number of processors required to 
solve the problem. The advent of Virtex-II 
Pro™ Platform FPGAs and the promise of 
more advanced, future Platform FPGAs are 
completely changing the game of program- 
mable system design. 


Airborne Imaging 
Boeing is designing a state-of-the-art air- 


borne, electro-optical imaging system. 
The end product will provide exceptional 
surveillance and engagement capability 
for aircraft in the 21st century. To achieve 
these objectives, Boeing designers are 
integrating a broad array of technologies 
into a complete programmable system. 
Key design elements include: 
¢ A high-resolution gimbaled telescope is 
driven by line-of-sight control electron- 
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ics. This system implements control algo- 
rithms that accept operator inputs to 
point the telescope to ground-based 
objects of interest. 


¢ Precision optics direct the light collected 
by the telescope into a series of imaging 
cameras. Aircraft engine vibration, struc- 
tural resonance, and other disturbances 
can wreak havoc on the alignment of 
these devices. To compensate for these 
effects, an automatic alignment system is 
incorporated. The goal is to stabilize the 
imaging system so it can yield a clear, jit- 
ter-free image for the system operators. 


Sensitive imaging cameras act as the 
“eyes” of the electro-optical payload. 
These devices provide high-resolution 
daytime and nighttime video to real-time 
image processing hardware. 


Sophisticated image-processing algorithms 
that perform video enhancement, segmen- 
tation, and feature extraction. 


Together, these subsystems allow an opera- 
tor to identify and interrogate ground-based 
objects with unprecedented capability. Figure 1 
highlights some of the essential functions of 
this system. 


Boeing Subsystems 

Boeing has chosen the Virtex-II Pro Platform 
as a critical element in this programmable 
system. The FPGA device and the support- 
ing chips connected to it execute the follow- 
ing functions: 


¢ High-speed image processing 


¢ Digital-video scene generation 
and storage 


¢ Fiber-optic media conversion of 
video data 


¢ DVI display generation 
e NTSC video generation 
¢ Servo-control functions 


¢ High-bandwidth data transfer 


interfaces. 


Embedded DSP 

One reason Boeing chose the Virtex-II Pro 
platform is that it incorporates three of the 
most important elements for a modern, 


embedded DSP device. 
54 
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Figure 1 — Essential 


system functions 


. A high-performance, high-density, 


programmable logic fabric — this capa- 
bility is the heart and soul of true par- 
allel processing. FPGA technology 
lends itself very well to the repetitive, 
systolic nature of our algorithms. 
Embedded multipliers, SRL16 capabil- 
ities, true dual-port Block RAM, and a 
segmented routing architecture give us 
key enabling technologies to produce 
powerful designs. 


. A wealth of interface pins that can be 


configured for different signaling stan- 
dards and easily integrated with devices 
from other vendors — using a single 
Xilinx chip, Boeing’s design team can 
arrange a device with a PCI bus inter- 
face, video encoder/decoder connec- 
tions, digital-video display outputs, 
multiple SRAM interfaces, and a high- 
speed digital-video input port. These 
functions run in parallel, concurrently, 
without the hassles of bus contention 
and multi-processor scatter/gather issues. 


The Virtex-II Pro multi-gigabit trans- 
ceivers (MGTs) raise the bar to anoth- 
er level. MGTs are extremely useful for 
the imaging camera interfaces. Boeing’s 
design team can use a low-end Virtex- 
II Pro device to serialize several parallel 
digital video outputs. These serial out- 
puts are easily converted to fiber-optic 
media. And because the data transfer is 
point-to-point, designers can program 
the serial transceivers with a user- 
friendly data protocol. 
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3. An embedded IBM PowerPC™ 405 
CPU — the point of using an FPGA 
in many image-processing applica- 
tions is to unburden the host CPU 
from simple, brute-force calculations. 
With the embedded PowerPC CPU, 
the host is left unimpeded to handle 
the complex decision-making logic 
that forms the brains of a system. 


PowerPC Advantages 

Many people might argue that it’s easy 
enough to hook an FPGA to a PCI bus 
interface — so why bother with the embed- 
ded PowerPC CPU in the Virtex-II Pro 
platform? For some applications, a PCI 
interface is sufficient, but Boeing’s design 
team has found many compelling reasons 


to choose an embedded CPU: 


¢ By using logic to accelerate the embed- 
ded CPU, most of the residual back- 
end processing for our applications 
requires only a modest amount of 
compute power. We all like having 
multi-GHz CPU power, but it is not 
always necessary. 


The embedded PowerPC CPU con- 


sumes a fraction of the power of a high- 


end CPU, while remaining directly 
coupled to the logic fabric. In Boeing’s 
business, it seems that Moore's law does 
not always apply. RISC CPUs for 
embedded applications have not general- 
ly kept pace with desktop systems in 
terms of clock speed and performance. 
Packaging and thermal dissipation con- 
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straints make it very hard for embed- 
ded. hardware developers to offer 
multi-GHz-class processor subsystems 
in a small form factor. DSP processors 
address many power consumption 
issues, but lack the logic interface 
flexibility. For many applications, 

the embedded PowerPC provides the 
right balance of performance, power, 
interface, and general-purpose com- 
puting capability. 


¢ Those who think PCI is easy should 
try using the CoreConnect™ bus with 
dual-port RAMs. In a matter of min- 
utes, you can configure an FPGA 
interface to a 405 processor via high- 
speed BRAM. It’s like having your data 
put right into the cache RAM. Now 
consider that multiple, independent 
FPGA processes can stuff these RAMs 
in parallel — and with completely dif- 
ferent clocks than those used by the 
PowerPC CPU. You get high-band- 
width, extremely simple data transfer 
without the vices or overhead of PCI. 
Put simply, there is far more room for 
creative interfaces, application-specific 
tailoring, and flexibility in the Virtex-II 
Pro platform. 


Consider your application model for 
your microprocessor. Boeing's design 
team gets plenty of mileage out of 
using the embedded CPU as a “micro- 
controller.” If you dispose of a full- 
blown operating system and run the 
PowerPC CPU with a lightweight ker- 
nel, the internal Block RAM can hold 
a significant amount of user code. 
Even better, the FPGA is now both 
hardware and software reconfigurable — 
in real time. A single unified bitstream 
defines the operating characteristics for 


both the CPU and logic. 


Major Features 
In addition to the device’s embedded DSP 
capability, Boeing’s design team makes use of 
almost every major feature of the Virtex-II 
Pro logic fabric: 
¢ The embedded multipliers are very 

useful for DSP-based algorithms. 

They provide high silicon efficiency 

and effortlessly support the clock rates 

our designs require. 
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Figure 2 — Before and after 
Virtex-LI Pro implementation 


The dual-port Block RAM is a veritable 
Swiss Army knife in terms of function- 
ality. It can be used for tapped delay 
lines, command and status data buffers, 
lookup tables, bus-width conversion, 


FIFOs, and so forth. 


The SRL16 capability with dedicated 
fast-carry logic gives the Virtex-II Pro 
platform a key advantage in DSP 
applications. Both features are critical 
to the design of high-performance 
digital filters and the SRL16 mode 
helps to significantly reduce the logic 
footprint of DSP algorithms. 


DCI (digitally controlled impedance) 
technology is a significant aid when 
implementing functions such as DDR 
RAM interfaces. Boeing’s PCB designer 
no longer struggles with finding a way to 
place hundreds of termination resistors. 


The DCMs (digital clock managers) 
provide an easy-to-use, flexible clock 
management scheme. Our design team 
has leveraged both the frequency syn- 
thesis and precision phase shifting capa- 
bilities in many designs. The new ISE 
architecture wizards make instantiating 
them easy. 


The MGTs allow us to create high- 
bandwidth interfaces to a high-speed 
serial link that operates as if it were a 
parallel data bus. Ninety-five percent of 
the design problems in this application 
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are simply about sending pixel data from 
A to B. Rapid IO, 3GIO, InfiniBand, or 
Fibre Channel are generally overkill for 
this type of data transfer requirement. 
The MGTs provide serial interfaces in a 
way that has reduced board size tremen- 
dously. The 8b/10b and SerDes (serializ- 
er/deserializer) chips that we used in a 
prior design were actually bigger than an 
entire Virtex-II Pro device. 


Figure 2 shows how the Virtex-II Pro 
platform enabled Boeing’s design team to 
dramatically reduce system size and cost. 

Boeing’s high-performance imaging 
system is taking advantage of the complete 
Xilinx solution. We are using most of the 
features in the silicon as well as Xilinx 
handcrafted IP cores, System Generator 
for DSP, and CORE Generator™ tools. 
These tools enable us to focus on the spe- 
cific elements of the design and spend less 
time constructing the architecture that 
surrounds it. 


Conclusion 

Virtex-II Pro Platform FPGAs raise the bar 
of both capability and complexity for pro- 
grammable system design. Boeing’s design 
team chose Xilinx solutions because the 
Virtex-II Pro Platform FPGA 
unmatched “system” capability along with a 


offers 


broad range of devices from which to 
choose. For more information on Virtex-II 
Pro Platform FPGAs and design resources, 
go to www.xilinx.com/virtex2pro. & 
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Xilinx Virtex-II Pro™ 
Development Kit 






microprocessor and I/O intensive applications. 


Part Number 
ADS-XLX-V2PRO-DEVP7-5 


Description 


Xilinx Virtex-II Pro Development Kit, 
with XC2VP7, -5 speed grade 


Xilinx Virtex-II Pro Development Kit, 
with XC2VP7, -6 speed grade 


ADS-XLX-V2PRO-DEVP20-5 Xilinx Virtex-Il Pro Development Kit, 
with XC2VP20, -5 speed grade 


ADS-XLX-V2PRO-DEVP7-6 


Part Number 
ADS-XLX-SP2E-EVL 
ADS-SP2E-MB-EVL 


Description 
Xilinx Spartan-Il Evaluation Kit 


Xilinx Spartan-lIE Evaluation Kit bundled 
with Communications/Memory Module and 
MicroBlaze Core License 


Other Evaluation Kits Available from ADS: 
Denali RLDRAM IP Core Demonstration Kit 


Part Number 
ADS-DLI-RLDRAM 


Description Resale Price 


Denali RLDRAM IP 
Core Demo kit 


Contact your local Avnet 
FAE for more information. 


Based on the Virtex-ll architecture, this kit provides 
a unique environment for developing high-performance 


Resale Price 
$1,995.00 USD 
$2,495.00 USD 


$2,995.00 USD ADS-XLX-MB-DEV1500 


Xilinx Spartan-IIE™ Evaluation Kit 

Designed to develop and test designs targeted to the Xilinx 
Spartan-lIE FPGA family, this kit is optimized for the low-cost, consumer 
digital convergence market and can include a MicroBlaze Core License. 


Part Number 
ADS-XLX-V2-DEV1500 


Description 


MicroBlaze Core License 


> 


DS-XLX-V2-DEV4000 
DS-XLX-MB-DEV4000 


> 


MicroBlaze Core License 


>| > 








> 


Xilinx Virtex-I] Development Kit populated with an XCV1500 device 
Bundled with Communications/Memory Module and 


ADS-V2-MB-DEV1500XP —_ Bundled with Communications/Memory Module, 
MicroBlaze Core License and high-current power supply 


Xilinx Virtex-I] Development Kit populated with an XC2V4000 device 
Bundled with Communications/Memory Module and 


DS-XLX-V2-DEV4000XP XC2V4000 and high-current power supply 


DS-V2-MB-DEV4000XP = Bundled with Communications/Memory Module, MicroBlaze 
Core License and high-current power supply 


ADS-XLX-V2-DEV6000XP Xilinx Virtex-I] Development Kit populated with an XC2V6000 
and high-current power supply 


DS-V2-MB-DEV6000XP = Bundled with Communications/Memory Module, MicroBlaze 






Xilinx Virtex-II"™ Development Kit 

Designed to develop and test designs targeted to the 
Xilinx Virtex-Il FPGA family, this kit offers up to a 6 million system 
gate prototyping environment and can include a MicroBlaze Core License. 


Resale Price 
$1,000.00 USD 
$1,400.00 USD 


$1,900.00 USD 


$2,500.00 USD 
$2,900.00 USD 


$3,000.00 USD 
$3,400.00 USD 





$6,000.00 USD 
$6,400.00 USD 





Core License and high-current power supply 


w 







Resale Price 4 
$249.00 USD P —_ 
$650.00 USD 


Audio/Video Module 
The Audio/Video Module is an expansion 


daughtercard and offers a platform to develop and 


test products that require audio and/or video. 


Part Number Description 
ADS-AV-DAU Audio/Video Module 


Communications/Memory Module 


Part Number Description Resale Price 
ADS-EMU-DAU Communications/Memory Module $250.00 USD 
ADS-XLX-MB-DAU Communications/Memory Module $499.00 USD 


bundled with MicroBlaze 
Core License 


Resale Price 
$300.00 USD 


Motorola ColdFire® Evaluation Kit 


Part Number Description Resale Price 
ADS-MOT-5282-EVL Motorola ColdFire Evaluation Kit $250.00 USD 
ADS-MOT-5282PE-EVL Motorola ColdFire Evaluation Kit $300.00 USD 


with P&E BDM interface 


™ 
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Avnet Design Services 
Accelerates Design Cycles 


Avnet Design Services (ADS), the technical arm of Avnet Cilicon, 
provides a variety of technical resources and capabilities to speed 
your design cycles. From concept to reality - ADS can help get 
your products to market faster with: 


¢ Reference designs 

¢ Development & evaluation kits 

¢ Fee-for-service FPGA & ASIC designs 

¢ Support of Field Applications Engineers (FAEs) 

¢ Product & system-level architecture selection assistance 
¢ IP evaluation 


Partnering with Avnet Cilicon means working with a company 
that is dedicated to giving you Support Across The Board.™ 


For more information or to purchase an evaluation kit, 
visit www.avnetavenue.com/xcell 





Ready. 
Set. 
Go to market™ 
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1 800 332 8638 
www.avnetcilicon.com 
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A new low-cost development kit for Xilinx 
CoolRunner-lI CPLDs helps experts and novices 
alike get the most trom their designs. 
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by Steve Prokosch 
CPLD Marketing Manager 
Xilinx, Inc. 
steve.prokosch@xilinx.com 


The new CoolRunner™-II RealDigital 
CPLD Design Kit contains everything you 
need to explore all the new capabilities of 
the CoolRunner-II RealDigital CPLD. 
With this kit, you will be able to do every- 
thing necessary to test your designs in the 
industry's best 1.8V CPLD. 

For low-power/high-speed designs, a 256 
macrocell CoolRunner-II device in a 144- 
pin thin quad flat pack (TQ144) package is 
included with every kit. If you need more 
programmable logic or want to try a different 
Xilinx CPLD, an additional socket accepts a 
CoolRunner-II 64 or 32 macrocell device for 
other low-power/high-speed designs. 

Want to test run a 2.5V or 3.3V Xilinx 
CPLD? This socket also accepts Xilinx 
best-selling XC9500XL or XC9500XV 
devices in 36 or 72 macrocell sizes. 

The CoolRunner-II CPLD design kit 
board (Figure 1) can be powered either by a 
standard power supply or by two AA batter- 
ies (battery holder included). The kit also 
includes a parallel cable for programming 
your CPLD. The cable simply goes from 
your PC parallel port to the parallel port 
connection on the board. 

To help you get started, we have 
included a Programmable Logic Quick 

Start Book that contains installation 

instructions for the ISE WebPACK™ 
software, as well as step-by-step 
instructions on how to use, enter, and 
synthesize designs, and how to target 
Xilinx programmable logic devices. 


Fall 2003 





To give you a good jump-start on designing 
your own applications, we give you design 
examples in both schematic and high-level 
language, such as VHDL. 

The kit comes with a resource CD that 
contains a great deal of useful information: 


¢ Reference designs in VHDL, user con- 


straint file (UCF), and HDL design files 


¢ Instructional video demonstrating the 
entire process, from design to program- 
ming of the device 


¢ ISE WebPACK overview and software 
options for different Xilinx devices 


¢ Links to related websites 


¢ Application notes, data sheets, power 
consumption comparison, product 
pages, articles, and other product 


presentations 


¢ Technical presentations and 
packaging information 


¢ Top 10 FAQs and where to find 


other answers to questions 


¢ Product brochures, reference 
manuals, and product briefs 


¢ Video featuring Xilinx CEO 
Wim Roelandts. 


Simple Tutorial, Advanced Features 

Whether you already design with program- 
mable logic or would like to begin, this kit 
can help you. The accomplished designer 
will appreciate the many advanced features 
and new interfaces on CoolRunner-H, 
including high-speed transceiver logic 
(HSTL) and stub-series-terminated trans- 
ceiver logic (SSTL). 

Or you might experiment with the 
other new memory device interfaces not 
supported on your current microprocessor. 
A breadboard area allows you to add com- 
ponents for use in future designs, and you 
can also connect a daughtercard to the 
CoolRunner-II design kit board via the 
dual inline headers. 

Designers new to programmable logic 
can use the demo board with its tutorial 
to quickly get up to speed on tool usage. 
The demo covers all aspects of designing 
a CPLD, from design entry with 
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WebPACK, through simulation, to pro- 
duction of a JEDEC file used to program 
the device. This straightforward tutorial 
should get you up and designing within 
hours, and can also be used to encourage 
lab technicians to explore their own 
design aspirations. 

The demo board is handy to have in 
your lab to test new design ideas for imple- 
mentation on current products. If your 
system already includes a CPLD, it serves 
as a nice test bed for running upgrade 
tests. If power is your concern, the 





Figure 1 — CoolRunner-II design kit board 


CoolRunner-II development board is espe- 
cially useful for running different designs 
to test power consumption. If you want 
some ideas about reducing power con- 
sumption, check out the low-power tips 
and tricks application note XAPP346 on 
the CLPD application notes webpage 
(www.xilinx.com/appslepld.htm — see Table 1 
for a list of all of the application notes 
described in this article). 

The 256-macrocell (XC2C256) CPLD 
in a TQ144 — with room for other devices 
— gives you a good sense of design capacity. 
With this device configuration you get 118 
I/Os, two I/O banks, clock doubling/divi- 
sion, and voltage-referenced I/O. If you 
need. voltage-level translation, this device 


can handle 1.5V, 1.8V, 2.5V, or 3.3V I/Os. 

If your design is relatively simple, try the 
32 or 64 macrocell device. If your design is 
complex, try the 256 macrocell configura- 
tion. If you think partitioning your design 
offers the best solution, use both devices. 
This board is designed to give you the max- 
imum amount of flexibility. 


What Can You Do With It? 
With both a breadboard area and dual 
inline header sockets, the design kit board 
is suitable for developing on-board 
applications, attaching 
daughtercard boards, or 
communicating with 
another board via head- 
ers. It is large enough to 
accept many useful 
designs, including stan- 
dard bus protocols (SPI 
and [?C), 


troller interfaces, and 


microcon- 


serial communications, 
such as Infrared Data 
Association and univer- 
sal asynchronous receiv- 
er transmitter (UART) 
which have 


already been created by 


designs, 


Xilinx application engi- 
neers. These last items 
are available free by 
downloading application 


note XAPP345. 


PicoBlaze Soft Processor 

The CoolRunner-II design board is 
extremely useful for evaluating the 
PicoBlaze™ soft processor in your 
design. Available for downloading free 
from application note XAPP387, the 
PicoBlaze soft processor offers a constant 
K-coded programmable state machine, 
and is written in VHDL and C program- 
ming languages. Extremely customizable, 
both its size and functionality can easily 
be changed, making PicoBlaze an ideal 
application for content-sensitive designs. 
The PicoBlaze soft processor provides 49 
different instructions, eight 8-bit regis- 
ters, 256 directly addressable ports, and a 
maskable interrupt. If you simply want an 
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8-bit microcontroller interface, take a look 
at application note XAPP349. 


Memory Interface 

The CoolRunner-I] design board may well 
be the best solution for testing new memo- 
ry interfaces. With multiple memory inter- 
faces — including HSTL, SSTL, and 
LVCMOS — you can test SDRAM, 
SSDRAM, MSDRAM (mobile SDRAM), 
UtRAM (UniTransistor RAM), cellular 
RAM, and flash memories. You can even 
design your own memory controller for 
new flavors of portable or mobile RAM as 
they become available. 

The CPLD application notes webpage 
contains many memory design examples, 
including beginning designs for flash and 
DDR SDRAM memory. Check this web- 
site often for new memory interfaces and 
updates or to download an HDL design 
and get email notification of new applica- 


tion notes. 


Crosspoint Switch 

The non-blocking architecture allows each 
output to be independently connected to 
any input, and any input to be connected to 
any or all outputs. The double-row latch 
architecture utilized in this design allows 
switch reprogramming to occur in the back- 
ground during operation. Activation of the 
new configuration occurs with a single con- 
figuration pulse. Each output can be individ- 
ually disabled and set to a high-impedance 
state, allowing easy expansion to larger 
switch array sizes. This is covered in detail in 
application note XAPP380. 


OTF Reconfiguration 

On the fly (OTF) reconfiguration permits 
the CPLD to operate with an initial design 
pattern while simultaneously acquiring a 
second pattern. For example, your first 
design pattern can be a power-up applica- 
tion, such as a built-in self-test (BIST), and 
the second pattern can be one of normal 
device operation. If you want to run diag- 
nostics on your system on power-up, you 
would load a functional pattern to perform 
additional bus interface support. The second 
pattern can be configured into the device 
with minimal disturbance to its operation. 
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Another important consideration is that 
OTF reconfiguration enhances security, 
because switching keys on the fly permits 
robust protocols for data communication 
designs that can be tough to crack. OTF 
reconfiguration takes in-system program- 
ming (ISP) to a new level and will 
undoubtedly spawn many new applications 
that take advantage of this capability. More 
information about OTF can be found in 
application note XAPP388. 


FPGA Downloader 
Another common use for CPLDs is as an 
FPGA downloader. At power-up the CPLD 
sequentially loads each FPGA with a pat- 
tern, which is stored in exter- 
nal memory. By assigning a 
CPLD to accomplish this 
task, you can get double duty 
from the CPLD. On power- 
up, it loads the FPGA; after 


this power-up task is done, 


XAPP137 


XAPP345 
XAPP346 
XAPP349 
XAPP380 
XAPP387 
XAPP388 


you can OTF reconfigure the 
CPLD and use it as a func- 
tioning device in your 
design. It’s like getting two 
devices for the price of one. 
To learn more about this 
application, see application 
note XAPP137. 





If you do experience problems, a website 
has been set up for you at www. 


digilentinc.com to help troubleshoot the board. 


Conclusion 

CoolRunner-II RealDigital CPLDs can be 
the perfect solution for any high- 
speed/low-power design and cover many 
application areas. In addition to their fea- 
tures, these devices also come in packages 
that suit high volume and small form fac- 
tor packaging. 

To order your CoolRunner-II RealDigital 
Design Kit, just go to www.xilinx.com/cpld/ 
and click on “CoolRunner-II Design Kit 
for $49.99.” && 


Configuring Virtex FPGAs from Parallel EPROMs 

with a CPLD 

IrDA and UART Design in a CoolRunner CPLD 

Low Power Tips for CoolRunner Design 

CoolRunner CPLD 8051 Microcontroller Interface 
Building Crosspoint Switches with CoolRunner-II CPLDs 


PicoBlaze 8-Bit Microcontroller for CPLD Devices 





“On the Fly” Reconfigurations 


Table 1 — All CoolRunner-LI application notes can be 
found at www.xilinx.com/apps/epla.htm. 
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Develop Applications 
Right Out ot the Box 


Avnet's Virtey-lI Pro development kit is a full-featured hardware 
development environment that includes Linux OS, 10 Gigabit 
Ethernet modules, extensive memory, and interface support. 


by Warren Miller 

Vice President of Marketing 
Avnet, Inc. 
warren.miller@avnet.com 


One of the best ways to accelerate your 
design time is to use standard hardware 
development platforms to prototype, evalu- 
ate, and test your complex designs. With the 
right platform, you can quickly create and 
test complex hardware — a task that is much 
too difficult to do in software. 

Complex networking applications such as 
Gigabit Ethernet switches and routers 
require hardware platforms so various design 
alternatives can be evaluated. Compliance 
and field tests are also only possible with 


hardware platforms. Acceleration of these 


tasks can make all the difference in the time- 
to-market race. 
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Aynet’s development kit with a Xilinx 
Virtex-II Pro™ Platform FPGA provides 
the right mix of hardware, support, third- 
party tools, and application examples to 
allow you to create a platform for real-life 


designs. The kit includes up to a Virtex-II 
Pro XC2VP20 device and enough memory, 
peripheral devices, and interfaces to create a 
complete hardware development environ- 
ment right out of the box. 


Figure 1 — Avnet-designed Virtex-II Pro XC2VP20 development board 
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10 Gigabit Ethernet Designs 

Many features of the Avnet Virtex-II Pro 
development board, shown in Figure 1, 
are especially useful to support 10 Gigabit 
Ethernet designs. The Virtex-II Pro 
device, an XC2VP20-FF896, provides the 
logic capacity needed for 10 Gigabit 
MAC and other supporting IP cores. 
Extensive on-board memory provides suf- 
ficient storage for both code and data 
embedded IBM™ 


PowerPC™ processors found on the 


space for the 


Virtex-II Pro device. High-speed interface 
functions using the RocketIO™ features 
of the Virtex-II Pro Platform FPGA are 
supported on the board with a variety of 





Kit does not include modules 
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(32 MB) 
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CF 


2 MB SRAM 
(PCI Memory Space) 





Figure 2 — Development board block diagram 


62 Xcell Journal 


Cypress 
CY7C1062AV33-12BGC. 


FLASH 
(16 MB) 


standard connectors. Xilinx IP cores pro- 
vide the logic functions required to 
implement the physical and logical layers 
of common serial communications stan- 
dards. A detailed block diagram of the 
board is shown in Figure 2. 

The development board has enough 
memory devices to support even a complex 
design. Memory support includes: 

¢ Micron DDR SDRAM SODIMM 
(128 MB expandable to 1 GB) 


¢ Micron Mobile SDRAM (two 8 Mb 
x 16 Mb devices, 32 MB total) 


¢ Cypress Asynchronous SRAM 
(512 Kb x 32 Kb, 2 MB total) 


ee a = 
A Single XPAK 
Rocket 1/O and a Combination of 
Two SFP and Two HSSDC2 
3. rs Connectors 







Virtex-ll Pro 
Xilinx 


SelectMAP. 


PCI Bridge 
Xilinx 
XC2S300E- 
6FG456C 


Voltage Clamp 
PCI Edge Connector 








¢ Intel StrataFlash (16 MB total) 
¢ CompactFlash card 


¢ Configuration memory using Xilinx 
PROM XC18V02-VQ44 and the 
Xilinx System ACE™ CF solution. 


With 1 GB of high-speed DDR SDRAM 
memory, bandwidth will not be a bottleneck 
for most 10 Gigabit Ethernet applications. 

The board supports a variety of high- 


speed interface standards. These include: 


¢ A standard XPAK module (located on 
the back of the board) that can imple- 
ment either a 10 Gigabit Ethernet port 
or an OC-192 serial interface 





Clocks 
pes 
Switches 
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DB9 
Right Angle 


PROM 
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¢ Two HSSDC2 connectors for imple- 
menting InfiniBand™ or Fibre 


Channel interfaces 


¢ Two small form-factor pluggable (SFP) 
modules for implementing Gigabit 
Ethernet, Fibre Channel, or InfiniBand 
protocols. 


These interfaces can be used in con- 
junction with RocketIO transceivers on 
the Virtex-II Pro device and the appropri- 
ate Xilinx IP cores to implement the 
desired serial interface standard. A bridg- 
ing or switch application could use one or 
more of these ports as the other end to a 
10 Gigabit Ethernet interface. 

Expansion of the basic features of the 
board is possible by using the PCI bus or the 
four 140-pin general purpose connectors 
(AvBus connectors). A variety of AvBus- 
compatible boards, from Avnet and. third 
parties, provide even more hardware options. 


A Speedy Start 

In addition to the extensive hardware fea- 
tures described above, the kit comes with a 
variety of supporting documentation and 
design files to make it easy to get started. 
The kit ships with a detailed user manual, 
quick start guide, and an extensive board 
support package that includes a full-featured 
Linux kernel, PCI bus, and memory 
drivers. Several demonstration examples 
show how to best use the features of the 
Virtex-II Pro device and interface it to 
additional devices on the development 
board. Two of the projects are PowerPC- 
based and give examples of memory inter- 
faces — one for an OPB interface and one 
for a PLB interface. 

Another example project is RocketIO- 
based and illustrates the use of a custom 
peripheral to allow PowerPC access to 
the multi-gigabit transceiver (MGT) at 
3.125 Gbps. 


Implementing 10 Gigbit Ethernet 

Several of the design considerations in the 
development kit will be common to any 
Virtex-II Pro-based design and are shared 
here. On our design, the MGTs of the 
Virtex-II Pro Platform FPGA are connect- 
ed to two HSSDC2 connectors, two SFP 
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ADS-XLX-V2PRO-DEVP7-5 (Populated with XC2VP7, —5 speed grade) 
ADS-XLX-V2PRO-DEVP7-6 (Populated with XC2VP7, —6 speed grade) 
ADS-XLX-V2PRO-DEVP20-5 (Populated with XC2VP20, —5 speed grade) 


ADS-XLX-V2PRO-DEVP20-6 (Populated with XC2VP20, —6 speed grade) 





$1,995 
$2,495 
$2,995 
$3,495 


Table 1 — Part numbers and prices of the Avnet Virtex-LI Pro development kit 


connectors with EMI cages, and pads for 
an XPAK host connector with mounting 
holes for the mid-board module holder. In 
the case of the two HSSDC2 and two SFP 
connections, the MGTs were treated indi- 
vidually. This means that the lengths of the 
transmit and receive signals were matched 
per MGT but not matched to any other 
MGT. However, in the case of the XPAK 
interface, four MGT channels were bond- 
ed together to create the 10 Gigabit 
Attachment Unit Interface (XAUI). The 
four transmit pairs and the four receive 
pairs have matched lengths. 

The XPAK is a SFP transceiver module 
for 10 Gbps serial data transmission. The 
XPAK interface on the board was designed 
to run at the IEEE 10GBASE-R optical 
rate of 10.3125 Gbps with a four-lane 
electrical interface at 3.125 Gbps (XAUI 
interface). This interface requires the —6 
speed grade Virtex-II Pro device and the 
use of the Xilinx 10 Gigabit Ethernet 
MAC core. The XPAK interface on the 
Virtex-II Pro development board complies 
with the XPAK MSA (multi-source agree- 
ment) Revision 2.1 (except for the pro- 
grammable supply, which is implemented 
via jumpers). The XPAK MSA closely 
resembles the XENPAK MSA and makes 
frequent references to it. The XPAK form 
factor was used because it is half the size of 
a XENPAK, does not require a large 
cutout in the PCB, and has a mid-board 
mounting option, allowing the module to 
be placed anywhere on the board instead 
of on the faceplate. 

The 156.25 MHz differential clock 
input is used for the reference clock to the 
MGT macro. Because the phase-locked 
loop of the MGT always multiplies by a 
factor of 20, using the 156.25 MHz clock 


results in a transmission rate of 3.125 
Gbps. The transmit and receive signals are 
directly connected to the XPAK host con- 
nector or are DC-coupled. However, all 
XPAK-compliant modules have AC cou- 
pling on both the transmit and receive sig- 
nals inside the module itself. The transmit 
differential pairs are routed on the solder 
side of the board, while the receive pairs are 
routed on the component side. This keeps 
the signals from crossing on the way to the 
XPAK connector. The analog ground 
planes in the layers directly adjacent to the 
outer layers provide the return paths. The 
analog ground planes are separated from 
the digital ground used for the rest of the 
components on the board, but are refer- 
enced to digital ground in several locations 
through ferrite beads. 

More design details like these are docu- 
mented in the user guide, design files, and 
example designs included with the devel- 
opment kit. 


Conclusion 

The Avnet Virtex-II Pro development kit 
is designed with real applications in mind. 
It has all of the features demanded by this 
class of application — a large FPGA, exten- 
sive memory, and standards-based flexible 
high-speed serial ports, as well as consider- 
able expansion capability. The kit should 
enable a faster time to market for your 
next 10 Gigabit or other high-speed 
embedded design. 

The Avnet Virtex-II Pro development 
kit is available now. See Table 1 for avail- 
able Virtex-Pro devices and board prices. 
Contact your local Avnet sales office to get 
detailed ordering information and to talk 
to an Avnet FAE about your specific 
design need. 3 
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Memec Design Simplities 
Bluetooth Development 


+ __ Anew development kit 
from Memec Design lets 
you implement Bluetooth 
wireless technology on 

| SpartandlEFRGAS with 


MicroBlaze soft processors: 
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by Jim Beneke 

Director of Technical Marketing 
Memec 
jim_beneke@mei.memec.com 


Dennis Schlaht 

Director of Technical Marketing 
Insight Memec 
dennis_schlaht@ins.memec.com 


After several false starts, Bluetooth™ short- 
haul wireless connectivity technology 
for embedded systems finally seems ready 
for prime time. Bluetooth technology is 
just now beginning to gain momentum, 
and we expect it to grow significantly in the 
coming years. 

For that reason, Insight Memec and 
Memec Design have developed an 
Embedded Bluetooth Development Kit fea- 
turing a Xilinx Spartan™-IIE FPGA cou- 
Xilinx MicroBlaze™ 


pled with a 


soft-processor core. 


Bluetooth Basics 

Bluetooth technology is a frequency hop- 
ping spread spectrum (FHSS) system that 
operates in radio frequencies in the 2.4 GHz 
to 2.5 GHz ISM band, and it has a maxi- 
mum data throughput of 723.2 Kbps. 

As with most standards, interoperability 
is key to success. The Bluetooth Special 
Interest Group’s goals include maintaining 
an open specification, delivering voice and 
data capability, and providing worldwide 
usability for short-range wireless solutions. 

The specification defines the protocols and 
profiles used by Bluetooth-certified products. 
Usage models define the real-world applica- 
tions, and these usage models result in profiles 
as defined in the Bluetooth specification. 

Profiles are basically instructions for 
implementing usage models. The profiles 
assure interoperability by providing a well- 
defined set of higher layer procedures and 
uniform ways of using the lower layers of 
the Bluetooth protocol. The serial port pro- 
file (SPP), for example, provides basic RS- 
232 serial cable emulation for Bluetooth 
devices. Legacy applications do not have to 
be modified to use Bluetooth technology; 
they can simply treat a Bluetooth link as a 


serial cable connection. 
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Figure 1 — Key elements of a typical Bluetooth application 


Adding Bluetooth capability requires 
the key elements shown in Figure 1. The 
RF radio and the baseband processor are 
typically 
Bluetooth transceiver solution. 


available as an integrated 
The transceiver chip provides a serial 
interface to the host processor via USB or 
UART ports. The host processor typically 
executes the upper protocol-specific func- 
tions defined by the Bluetooth standard. 
These protocol functions are known as 
the Bluetooth protocol stack and can be 
licensed in processor-specific binary or 
independent source code formats from 
stack providers, such as Stonestreet One. 


Integrated Solutions 

In a Windows™ or PocketPC-based envi- 
ronment, a Penttum™ or ARM processor 
actually executes the upper stack functions, 
which are usually included as part of some 
Bluetooth software applications. 

In an embedded application, the 
Bluetooth upper stack and host processor 
function must be accounted for in some 
other way, typically with an embedded core. 

The host 


requirements for 


processor performance 
supporting the 
Bluetooth protocol are minimal, usually 
less than 1 MIPS. Thus, it is very easy to 


add Bluetooth support to an embedded 


Optional 
RS-232 


Windows PG 
HyperTerminal 








application if the host 
processor has processing 
bandwidth and a Bluetooth 
stack is available for the tar- 
geted processor. 

The 150 D-MIPS per- 
formance of the MicroBlaze 
soft-processor core means 
the additional 1 MIPS of processing 
overhead can often be accommodated 
with relative ease. 

The availability of the Bluetooth proto- 
col stack presents a bigger obstacle, because 
the stack must be ported to the targeted 
embedded processor. However, with the 
introduction of the Memec Design 
Embedded Bluetooth Development Kit, 
this issue has been addressed for any 
MicroBlaze-based system. 


The Embedded Bluetooth Kit 

The Memec Design kit provides all the 
necessary components to build and test a 
Bluetooth system: 


¢ An evaluation version of the Bluetooth 
stack ported to the MicroBlaze core 


¢ A Spartan-IIE development board 
¢ Two P160 Bluetooth modules 


e An evaluation version of the 
BTExplorer™ Windows application 


¢ Xilinx EDK software 
* Cables 
¢ Power supplies 


¢ Documentation. 


Figure 2 shows a typical setup of the 
Bluetooth kit. 





Windows PC 
BTExplorer 


Figure 2 — Bluetooth Development Kit with wireless link 
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A simple point-to-point radio system 
can be created with the kit components 
and a Windows-based PC. One end of the 
communication system is created with a 
standalone P160 Bluetooth module con- 
nected via USB or RS232 to a Windows 
PC. This PC runs the evaluation version of 
the BTExplorer application, which exe- 
cutes the Bluetooth stack within the 
Windows environment. 

The other radio system uses the 
Spartan-HE board and second P160 
Bluetooth module. The Spartan-IIE plat- 
form implements a MicroBlaze design, 
combining the evaluation Bluetooth binary 
stack and serial port profile with your 
application code. An optional second PC 
connects to the Spartan-IIE board for user 
feedback and control. 

Figure 3 shows the functionality of the 
P160 Bluetooth module based on 
Broadcom’s BCM2035 — a monolithic, sin- 
gle-chip, baseband processor with an inte- 
grated 2.4 GHz transceiver for Bluetooth 
vl.1 and 1.2 applications. It minimizes the 
footprint and system cost of implementing 
a Bluetooth link by integrating all critical 
components into the device. 

The BCM2035 is an ideal solution for 
any voice or data application that requires 
the Bluetooth standard Host Controller 
Interface (HCI) via either UART or USB, 
and PCM (pulse code modulation) audio 
interfaces. The integrated microprocessor 
unit stores the lower level protocol stack in 
ROM plus patch RAM to provide the maxi- 
mum flexibility while eliminating the need 
for external flash memory for the lower stack. 

The BCM2035 radio transceiver pro- 
vides enhanced radio performance to meet 
the most stringent industrial temperature 
applications or the tightest integration into 
portable devices. It provides the highest 
available radio performance of any single- 
chip device with -90 dBm sensitivity and 
+7 dBm programmable output power. 


Development System Capabilities 

The Spartan-IIE board from Memec 
Design provides all the necessary features 
for implementing a simple MicroBlaze 
design. The board includes the P160 expan- 


sion module slot for connection to the 
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Figure 3 — P160 Bluetooth Module functional diagram 
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Figure 4 — Typical MicroBlaze-based system 


P160 Bluetooth module, SDRAM, serial 
port, and miscellaneous support circuits. 
Figure 4 shows an example MicroBlaze 
system that incorporates the Bluetooth 
interface. Because the Bluetooth module 
looks like a standard serial port to the 
MicroBlaze system, the only hardware 
modification required to the system archi- 
tecture is the addition of a UART block. 
The Memec Embedded 


Bluetooth Development Kit includes a fully 


Design 


functional evaluation version of Stonestreet 
One's Bluetopia™ protocol stack. An 
implementation of the upper Bluetooth 
protocol stack, the Bluetopia software eases 
application development by providing a 
robust yet easy-to-use development tool 
that implements the Bluetooth protocols 
above the HCI. 

The Bluetopia application program- 
ming interface (API) provides access to the 
upper-layer protocols, including Logical 
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Host Controller Interface 


Figure 5 — Bluetooth protocol stack 


Link and Adaptation Protocol (L2CAP), 
Service Discovery Protocol (SDP), 
RFCOMM protocol, and OBEX protocol. 
Bluetopia software also provides APIs for 
all mandatory and most optional profiles. 

In addition, Stonestreet One provides 
lower level HCI transport drivers to inter- 
face to various Bluetooth devices such as 
the BCM2035. These lower level HCI 
drivers have been written to support the 
addition of new HCI transport drivers with 
little or no change to existing applications. 

The evaluation version of the 
Bluetopia protocol stack provides support 
for the following Bluetooth protocols: 
HCI, L2CAP, SDP, and RFCOMM. The 
Bluetopia stack also provides support for 
generic access profile (GAP) and serial 
port profile (SPP) Bluetooth profiles. 
Figure 5 illustrates an example Bluetooth 
protocol stack. 

In addition to the above listed protocols 
and. profiles, Stonestreet One will make 
available an extensive suite of existing and 
future Bluetooth protocols/profiles as 
required. Full development or production 
versions are available through license from 
Stonestreet One along with additional 
profiles if required. 

BT Explorer is a user-friendly Windows 
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application that is used to connect and 
manage Bluetooth devices. BTExplorer dis- 
covers Bluetooth devices in the vicinity and 
presents them in an easy-to-understand 
format similar to Windows Explorer. 

As used in the development kit, 
BTExplorer facilitates a simple peer-to- 
peer system. When connected to the 
standalone P160 Bluetooth Module, 
BTExplorer provides the upper stack capa- 
bility to the module, thus creating a com- 
plete Bluetooth node. 


Conclusion 

The Memec Design Embedded Bluetooth 
Development Kit allows designers to 
enhance any new or existing MicroBlaze- 
based designs with the addition of a 
Bluetooth interface. By providing a 
MicroBlaze-specific port of the Bluetooth 
stack along with the necessary prototyping 
hardware, the kit eases the development 
process, shrinks the design cycle, and 
speeds your time to market. 

The Embedded Bluetooth Kit is avail- 
able for $1,495 from Insight Memec at 
www.memec.com/insight-memec/. Plans 
are under way to develop a similar devel- 
opment kit utilizing the new Xilinx 
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* in the Xcell Journal! 
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Save Time, Money, and Space 


Xilinx leads the industry in 300 mm waters and 90 nm process technologies. 


Xcell Journal 





by Steve Sharp 

Senior Manager, Programmable Logic Solutions Marketing 
Xilinx, Inc. 

steve.sharp@xilinx.com 


Xilinx is aggressively driving down costs 
for its customers, especially in the area of 
90 nm process technology and 300 mm 
wafer manufacturing. 

The industry leader in PLD (program- 
mable logic device) process technology 
since the introduction of its VirtexTM-E 
and Virtex-EM FPGA families in 1999, 
Xilinx was already well ahead of the 
Semiconductor Industry Association’s 
process roadmap. And we've maintained 
that lead by more than a year ever since. 

Our successful strategy includes part- 
nerships with two world-class fabrication 
partners: IBM™ in the United States and 
UMC™ in Taiwan. Both companies are 
experts in process technology and manu- 
facturing. They have allowed Xilinx to be 
the first PLD manufacturer to deliver prod- 
ucts based on 300 mm wafers and, more 
recently, 90 nm process technology. 


Advantages Beyond Wafer-Thin 
Larger 300 mm wafers are imperative for 
lowering costs. With almost twice the usable 
wafer area and approximately 2.5 times the 
number of die per wafer, cost savings of 30% 
to 40% over 200 mm wafers are typical. 

According to Glen Yeung of the invest- 
ment firm Salomon Smith Barney, “[These] 
300 mm facilities are the wave of the future. 
For both chip and equipment companies, 
the move is an economic imperative.” 

The semiconductor industry has been 
successful in the use of “copy exact,” which 
makes it effective to transition a product 
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from one fabrication line to another. 
However, when moving from 200 mm 
wafers to 300 mm wafers, copy exact does 
not exist. This leaves the door open for 
process and equipment variabilities that 
can affect yield. 

“We would expect a significant invest- 
ment to convert an existing product from 
200 mm to 300 mm wafers,” said Jim 
Feldhan, president of Semico Research Corp. 


Re-Qualification Headaches 

Designers must always consider a re-quali- 
fication when a device moves from 200 
mm to 300 mm wafer manufacturing. 


These re-qualification efforts can include: 


¢ Review of the process 
change notice (PCN) by 
the designer 


e Assembling test boards 


¢ Re-characterization 


of boards 


¢ Full systems review, 
including evaluation 
of thermal stress and 
high/low voltage testing. 


Re-qualification can take as 
long as three to nine months 
and consume valuable engi- 


neering resources. 


Xilinx: At the Cutting Edge 

Xilinx moved established products, such as 
the Spartan™-IIE and Virtex-I] FPGA 
families, to 300 mm wafers last year. Xilinx 
has also chosen to release its Virtex-II 
Pro™ and Spartan-3 FPGA families 
directly on 300 mm wafers, saving cus- 
tomers from performing a system re-quali- 
fication later on. 

We realized that to deliver the maxi- 
mum cost benefit to our customers, it 
would be necessary to avoid requiring com- 
panies to re-qualify a product in their sys- 
tems when moving from 200 mm to 300 
mm wafers. Therefore, we have done the 
re-qualification for them. 


Cheaper Solutions in 90 nm 
The use of the latest 90 nm process tech- 
nology can also have a profound impact on 
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cost. With half the die size for an equiva- 
lent circuit on 130 nm technology, 90 nm 
technology is rapidly becoming the choice 
of industry leaders as they strive to reduce 
costs for their customers. 

As Dan Hutcheson of VLSI Research Inc 
points out, “Companies that get into 90 nm 
production first will get a tremendous advan- 
tage in lower cost. Rivals who are late in 
[adopting] 90 nm process technology will 
fall behind and may not be able to catch up.” 

When the die size advantage of 90 nm 
process technology is combined with the 
manufacturing efficiency of 300 mm wafers, 
the savings of a combined 90 nm/300 mm 
solution is a 5X increase in gross die per 





wafer when compared to 130 nm technolo- 
gy on 200 mm wafers. 

These cost savings — the largest in 
recent semiconductor history — could 
result in pricing of under $20 for one 
million system gates in 2004, which 
would be a seven-fold cost decrease from 
the year 2000. 


Conclusion 

Not all companies have the resources to 
develop such advanced technology, but as 
leaders in their respective industries, 
Intel™, IBM™, Texas Instruments™, 
and Xilinx are all adopting — and adapting 
to — the latest 90 nm/300 mm process 
technologies. Our customers will be the 
first to realize the tremendous cost benefits 
this new technology can deliver. & 


Xilinx Events 
and Tradeshows 


Xilinx participates in numerous trade 


shows and events throughout the year. 


This is a perfect opportunity to meet 
our silicon and software experts, ask 
questions, see demonstrations of new 
products and technologies, and 
hear other customers’ success stories 
with Xilinx products. 


For more information and the most 
up-to-date schedule, visit 


www.xilinx.com/events/. 


Worldwide Events Schedule 


September 9-11 

Military and Aerospace Programmable Logic 
Devices (MAPLD) International Conference 
Washington, DC 


September 15-18 
Intel Developer Forum 
San Jose, CA 


September 16-17 
Embedded Systems Conference 
Boston, MA 


November 17-19 

Software Defined Radio Technical 
Conference and Product Exposition 
Orlando, FL 


March 8-10, 2004 
Wireless Systems Design 
San Diego, CA 


March 30-April 1, 2004 
electronicaUSA with Embedded 
Systems Conference 

San Francisco, CA 
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Prototyping Solution 


mons AWESOME ASIC SO 
. . 


Cool Emulation Platforms 
DN3000K10 DN6000K10S 


4 Logic Emulation PWB with the Xilinx Single Virtexll - Pro PWB 





VirtexlI™ FPGA's. - 2VP70/2VP100 
- Up to 5 2V6000/2V8000's per PWB - 4 Separate DDR SDRAM's (16Mb x 16) 
-- 466,000 FF's! - 4 Separate SSRAM's (512k x 36) 
- 2 PLL's for high speed clock distribution - 2 Flash's (4M x 16) 
- 4 512k x 36 SSRAM's - Misc. serial I/O's 
- 1 GB SDRAM DIMM (PC133) - 2 PowerPC's 
- Monstrous amounts of interconnect - PCI / PCI-X or standalone 





- FPGA configuration using SmartMedia Card 





DN3000K10S 
Single VirtexII™ PWB 
- 2V4000/6000/8000 
- 2 PLL's for high speed clock distribution 
-3512k x 36 SSRAM's 
- 1 GB SDRAM DIMM (PC133) 
- FPGA configuration using SmartMedia Card 
- Sun, Win2000/NT/98, LINUX Drivers 
and Utilities 














pavninnnk{ unin i 








A Sample of Our Services: PCI Extender 


64-bit/66MHz Active Extender 


- ASIC & FPGA Design Translation - FPGA/ASIC/Board Design - Intel® 21154 PCl-to-PCI bridge 

i i = ] - Active PCI Extender with THREE 32/64-bit slots 
- HW Emulation and Testing Turnkey VerilogVHDL - Easy access to all PCI signals 
2 Algorithmic Acceleration Design and Verification - 5V -> 3.3V or 3.3V -> 5V signal translation 


- Frequency division of PCI Clock 
- Numerous LEDs show system status 
- +3.3V not needed on host 


Mr. Freaky Says: Mr. Lazy Says: 

"This stuff is WAY "| hate searching for 

cool!" bugs; these guys 
made my life easy!" 














- White-Backed 
Mousebird - Blue Naped Mousebird 


The DINI Group The 


1010 Pearl Street, Suite 6 7 
La Jolla, CA 92037-5165 y | 
http://www.dinigroup.com 


Voice: (858) 454-3419 


FAX: (858) 454-1728 G 4 
Email: sales@dinigroup.com rOoup 
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Take the Titanium Solution 


Xilinx Titanium Technical Service engineers coach clients through design issues. 


by Vikram Pasham 
Design Engineer, Xilinx Design Services 
Xilinx, Inc. 
vikram.pasham@xilinx.com 


Need to enhance your design productivity? 
Decrease your design costs? Accelerate time 
to market? With the complexities and pos- 
sibilities of today’s designs, it’s easy to get 
sidetracked from your main goals. 

Xilinx Titanium Technical Service pro- 
vides on-site or off-site support for clients 
on a contract basis. Titanium application 
engineers are adept at ensuring that you 
start and finish your designs the right way. 
Our engineers provide design methodology 
coaching to make sure you take the most 
efficient approach. And our engineers’ skill 
at tracing debug issues back to the design is 
one of our most powerful services. Here’s 
an example from one of our clients. 


Customer Challenges 

During the development of a high-definition 
television (HDTV) video application, the 
engineering team of our client company 
(which prefers to remain anonymous for 
competitive reasons) had developed propri- 
etary DSP algorithms for uniform video 
correction. These were implemented on a 
custom ASIC developed by a third party, 
which had several silicon bugs. 

Fearing that the bugs would cost them 
the chance to be first to market with their 
technology, the engineers ported part of the 
ASIC’s functionality to a Xilinx FPGA. Yet 
they still noticed that prototypes, when 
tested in the lab, had power-up and image 
distortion issues. Thus, they decided to 
bring in a dedicated Xilinx Titanium 
Technical Service application engineer. 
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The Titanium Solution 
Over the course of one week, a Xilinx 


Titanium engineer gave the client's engi- 
neering team a crash course on FPGA 
design techniques and constraints. He sug- 
gested replacing the existing design’s clock- 
ing structure with Virtex-II digital clock 
managers (DCMs) for multiple clock gen- 
erations. After learning about the features of 
DCMs, the engineering team discovered 
they could use DCM status bits to deter- 
mine if the digital video interface (DVI) 
clock was disconnected. Based on this dis- 
covery, the team designed a robust recovery 
mechanism. This solved the power-up and 
image distortion problems. 

The existing design was already using all 
of the resources of the FPGA, yet there were 
further requirements to add functionality. 
The Titanium engineer identified portions 





of the design that weren't suited for FPGA 
architecture. He recommended other changes 
for optimal efficiency, while still preserving 
design functionality. Knowing that Virtex-II 
silicon can easily meet 60 MHz DVI clock 
rates, he suggested using tight packing 
options in map tools, and using the multipass 
place-and-route and floorplanning features 
in Xilinx ISE tools. This freed up enough 
resources for the engineers to add the addi- 
tional features in the FPGA. 

To meet timing in the Virtex-II —4 speed 
grade part for the six different pin configura- 
tions, the Titanium engineer placed timing 
constraints to cover all the clocks, cross clock 
domain paths, and multicycle paths. 
Together, they put together working proto- 
types in the lab supporting the six different 
pin configurations. 

After these adjustments to the design, 
the company is now able to use a lower- 
speed grade part, reducing their system cost. 
They are on schedule to meet their time-to- 
market goals. 


Conclusion 
A Titanium Technical Service application 
engineer can work at Xilinx, on-site, or a 
mix of both. This flexibility allows the 
engineers to fully understand the needs and 
requirements of our clients, as well as lever- 
age Xilinx factory resources to resolve prob- 
lems and accelerate production. 

For more information about Titanium 
Technical Services, please call 800-888- 
FPGA (3742) or visit Attp://support. 


xilinx.com/support/services/titanium.htm. & 
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Maximize Your Success 
with Spartan-3 Designs 


Xilinx Global Services helps you get the most 


by Bill Okubo 

Solutions Marketing Manager, Global Services 
Xilinx, Inc. 

bill.okubo@xilinx.com 


Increasingly, FPGAs are being used as sys- 
tems platforms because they are a more 
strategic purchase for customers who 
require a comprehensive solution that 
includes software, silicon, intellectual prop- 
erty (IP), and specialized services. 

At the same time, FPGAs are expanding 
into new application areas such as digital 
signal processing (DSP), automotive, med- 
ical imaging, and video broadcast, where 
engineers new to FPGA technology need to 
learn quickly. Xilinx Global Services 
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enables customers to reduce the learning 
curve and development costs associated 
with creating programmable system solu- 
tions using the Spartan™-3 family. 


Xilinx Global Services Portfolio 

Xilinx 90 nm Spartan-3 Platform FPGAs 
are reshaping the semiconductor land- 
scape, and “on-demand” application engi- 
neers, global support centers, and 
personalized online support are some 
examples of how Xilinx is expanding its 
services to meet this evolving market 
trend. For Spartan-3 designs, the portfolio 
of support and services to meet these 
advanced requirements includes Education 


Services, Support and Titanium Technical 





Service, Xilinx Design Services, and the 
Xilinx Productivity Advantage Program. 


Education Services 
On any given business day, as many as six 
Xilinx Education Services classes may be 
taking place simultaneously at any number 
of cities around the world. Xilinx Education 
Services has new locations that expand our 
presence to 50 locations in North America, 
Europe, and Asia Pacific. In addition to 
public classes at Xilinx training facilities and 
regional training centers, we also offer pri- 
vate classes that can be taught either at your 
location or a nearby facility. 

For Spartan-3 and ASIC designers, Xilinx 
offers several new and free technical lecture 
modules to help get you up to speed quickly: 


¢ Overview of the Spartan-3 Family — 
Learn about the Spartan-3 architecture, 
its underlying technology and target 
markets, and software support features. 


e ASIC User to FPGA Design — Choose 
from three modules that include a 
technology comparison between 
FPGAs and ASICs, design flow differ- 
ences, and how to optimize ASIC code 
for implementation in Xilinx FPGAs. 


For an in-depth curriculum on design 
methodology for the Spartan-3 family, 
Education Services has recently updated 
several key courses, including: 


¢ Fundamentals of FPGA Design 


¢ Designing for Performance for the 


FPGA User 


¢ Designing for Performance for the 


ASIC User 
e Advanced FPGA Design. 


To help reduce your overall develop- 
ment and education costs, FPGA Essentials 
is a new promotional package that delivers 
both the Fundamentals of FPGA Design 
and the Designing for Performance courses 
at a savings of $200. 

For course schedules or more informa- 
tion about Education Services for Spartan-3 
FPGAs, visit www.xilinx.com/education and 
select the region where you would like to 
take a course. 
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Support and Titanium Technical Service 
Support Tools and MySupport.xilinx.com 
Support for software and IP tools can dra- 
matically improve the success of your 
design projects. Support.xilinx.com reduces 
downtime with 24/7 online support tools 
such as: 
¢ Answer Browser — Browse for informa- 
tion using the Answer Browser or use 
the powerful Advanced Search to pin- 
point your exact search location by cat- 
egory and product name. 


¢ Tech Tips — Learn tips and 
techniques from the experts 
through access to a categorical 
listing of tools, reference mate- 


rials, and other information. 


¢ Forums — A new Spartan-3 
forum lets you solve design 
issues by collaborating online 
with other designers. 


¢ Problem Solvers — Find 
answers to your design ques- 
tions in a logical manner with 
this interactive tool that can 
save you hours of work. 


For personalized support on 
the Spartan-3 family, you can 
enroll in MySupport.xilinx.com to 
receive automatic alerts on specific infor- 
mation you choose, such as updated 
information on data sheets, product 
change notices, and product discontinua- 
tion notices. 

Visit support.xilinx.com to find out more 
about MySupport and the support tools 
available for Spartan-3 designs. 


Titanium Technical Service 

Titanium Technical Service provides a 
dedicated applications engineer who will 
supply expert advice on the best strategies 
to achieve timing closure and design fit in 
the shortest amount of time. The 
Titanium team of engineers has extensive 
knowledge of the new Spartan-3 architec- 
ture, 90 nm technology, and design tech- 
niques that will help you achieve your 
design goals for developing a low-cost 
FPGA. Titanium engineers are available 
on a contract basis and can work remotely 
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or at your site. For more information 
about Titanium Technical Service, visit 
support.xilinx.com/titanium/. 


Xilinx Design Services 
Xilinx Design Services (XDS) combines 
skills and experience in system, logic, and 
embedded software designs to provide a 
unique development partner. Leveraging 
these skill sets will help you optimize your 
budget, schedule, 


and performance 


requirements for your project. 





FPGA Design from Specification 

XDS will develop designs from specifica- 
tion for projects such as ASIC conversions 
or driver development and _ integration. 
XDS has the expertise in optimizing Xilinx 
technology to provide the best solution, 
and excellent project management to 


ensure on-time and correct deliveries. 


FPGA System Design 

XDS has broad applications expertise — 
including an immense depth of experience 
with Xilinx embedded processing tools and 
products — for system architecture consult- 
ing, FPGA logic, and embedded software 
design projects. 


Embedded Software Design 

XDS also has tremendous experience with 
FPGA platform design, including proces- 
sors and gates, MicroBlaze™ soft proces- 
sors, and hardware/software co-design 


techniques. Projects such as complex 
embedded software designs with real-time 
constraints, driver development, and inte- 
gration with hardware are areas where 
XDS can help. 

For more information about Xilinx 


Design Services, visit www.xilinx.com/xds/. 


The XPA Program 

To help you use Spartan-3 FPGAs to their 
maximum potential, we offer everything 
you need — software, education and. sup- 
port services, and IP cores — deliv- 
ered in one comprehensive 
package called the Xilinx 
Productivity Advantage (XPA) 
Program. It allows you to get the 
tools and services you need, when 
you need them, without the worry 
of ordering each piece of the solu- 
tion separately. 

The XPA Program helps to 
reduce the overall cost of develop- 
ing your designs by bundling 
Xilinx tools and services at the best 
value. Because the XPA Program is 
set up with a single purchase order, 
it cuts down on the paperwork 
required to equip each designer 
and saves valuable time. Our pack- 
aged solution ensures that your 
engineering team will have everything they 
need to begin designing immediately. For 
more information about the XPA Program, 
visit support.xilinx.com/xpal/. 


Conclusion 

Xilinx Global Services is committed to 
helping you succeed with Spartan-3 
devices, the world’s lowest cost FPGA 
family. Our complete portfolio of support 
and services provides you with ready 
access to the knowledge and expertise 
that you need. By taking advantage of 
Xilinx Global Services, you will learn 
faster and be able to get better perform- 
ance from your Spartan-3 designs. For 
more information about Xilinx Global 
Services, visit support.xilinx.com, where 
you can learn about our support tools and 
MySupport, or select the “Education” or 
“Services” menus for more information 


on those topics. & 
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It’s a Programmable World 


Programmable World 2003 workshops ofter detailed instruction in tools, technologies, 
and methodologies of system designs. 


by David Vornholt 
AllianceEDA Program Manager 
Xilinx, Inc. 

david. vornholt@xilinx.com 


Why spend your evenings and 
weekends wading through product 
datasheets and application notes? 
Time to knowledge is just as 
important as time to market. 

In the international series of 
forums and workshops that make up 
Programmable World 2003, Xilinx 
and its partners have created dozens of 
free technical seminars designed to stir 
your imagination, accelerate your design 
cycle, and reduce the overall design cost. 


An Ecosystem of Leaders 

The driving concept __ behind 
Programmable World 2003 has been the 
recognition that we do not work in a vac- 
uum. Recognizing this, Xilinx has part- 
nered with such industry leaders as 
IBM", Agilent Technologies™, 
Cadence™ Design Systems, Wind 
River™ Systems, and Mentor Graphics®. 
The synergy of Xilinx and our partners 
plays an inherent role in the success of 
your designs. 

The definition and implementation of 
your design involves support from not 
just your IC supplier but also from your 
EDA, embedded tool, IP, and design serv- 
ice partners. This is our “ecosystem” of 
companies that want to support your 


Xilinx-based design. 
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Cost-Sensitive Designs 
The accessibility and usability of cutting- 
edge technologies will be demonstrated 
at Programmable World. 

For example, the Xilinx Spartan™-3 
family is based on IBM and UMC’s 
advanced 90 nm, 8-layer metal process 


technology. Xilinx is using 90 nm tech- 
nology to drive high-volume pricing 
down to under $20 per unit by the end of 
2004 for one-million-gate FPGAs. This 
represents a cost savings of up to 80 per- 
cent compared to competitive offerings. 


Programmable World Forums 

The Programmable World forums, which 
were held earlier this year, comprised 
high-level discussions and demonstra- 





tions from Xilinx and the partners men- 
tioned above. Other forum participants 
were Boeing™, Intel™, and ‘Texas 
Instruments™, among others. 
Attendees included hardware engi- 
neers, ASIC designers, system engineers, 
system architects, executives, and 
researchers from more than 27 indus- 
tries and government agencies, ranging 
from home networking to the military. 
In the forums, panelists and presen- 
ters explored the end-use possibilities 
and real-world design challenges of using 
multi-gigabit I/Os, embedded processors, 
low-cost platforms, and DSP solutions: 


= IBM discussed the design methods 
associated with the integration of new 
processor architectures in FPGA-based. 
systems. 


m@ Cadence Design Systems and Xilinx 
described the challenges — and solu- 
tions — when upgrading existing sys- 


tems for higher bandwidth. 


& Agilent Technologies addressed the 
growing difficulties of system verifica- 
tion and hardware/software debugging. 


& Xilinx presented strategies for solving 
the connectivity challenges of integrat- 
ing ASSPs, network processors, and 
memory devices. 


@ Xilinx described the use of processor 
cores to combine the strengths of both 
the processor and logic for optimum 
performance and cost. 


@ Xilinx examined technical trends in 
DSP and presented a case for new 
architectures and system-level design 
methodologies. 


@ Xilinx also demonstrated PCI 
Express™, RocketPHY™, 
MicroBlaze™/Spartan-3, and DSP 
system products. Agilent, Cadence, 
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Celoxica, IBM, Mentor Graphics, 
Nallatech, Synplicity®, and The 
MathWorks demonstrated their 
respective technologies. 


@ In a panel discussion entitled 
“What’s Required to Make High- 
Speed Serial Design a Reality,” pan- 
elists from Agilent, Cadence, IBM, 
Intel, Mentor Graphics, Texas 
Instruments, and Xilinx confirmed 
the impending proliferation of serial 
technology and systems. The pan- 
elists also highlighted the technolo- 
gies they're creating in anticipation of 


high-speed serial design demands. 


User Applications 

The Programmable World forums also 
included presentations on actual program- 
mable system designs: 


m Christopher Musial, a technical fel- 
low at Boeing-SVS, gave a presenta- 
tion entitled “Programmable 
Systems Transform the Design and 
Grote) Coutonte-teCost Me mace Teh yom Oy oi8(c) 
Systems.” This case study illustrat- 
ed the design processes and issues 
related to the implementation of a 
Xilinx Platform FPGA in a real- 
time adaptive optics control system. 
As Musial explained, the project 
had stringent constraints for cost, 
performance, power, and maintain- 
ability — parameters challenging 
many designers today. 


m “A Highly Scalable FPGA-Based Car 
Infotainment Platform Solution” 
was the topic of Dr. Karlheinz 
Weiss’ presentation. Dr. Weiss, the 
director of electronic system devel- 
opment at Elektroniksystem und 
Logistik GmbH, described the plan- 
ning, specification, and implementa- 
tion phases of an automobile 
infotainment system using Virtex-II 
Pro™ and Spartan FPGAs for the 
processing platform. 


This design integrated audio, naviga- 
tion, telephone, Internet services, telem- 
atics services, and video. 
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For those of you who missed the forums 
component of Programmable World — as 
well as those who wish to relive the experi- 
ence — many of the presentations are view- 
able through the video-on-demand feature 
at www.xilinx.com/pw2003. 


Free “University Style” Workshops 
Xilinx has also developed a second com- 
ponent to the Programmable World 
forums — a series of workshops from 
September to November 2003. After 
learning about the high-level concepts 
discussed at the forums, you now have 
the opportunity to drill down to the 
details of creating your particular design. 

The workshops are organized into four 
major “tracks,” each comprising six one- 
hour sessions. You can attend any of the 24 
sessions. 


@ Track A: Designing Flexible and 
High-Performance DSP Systems. 
When does it make sense to use DSP 
processors or platform FPGAs individ- 
ually, and when does it make sense to 
combine the unique signal processing 
capabilities of both? Example scenarios 
help illustrate how to best answer 
these design dilemmas. Also, industry- 
leading DSP solution providers 
including The MathWorks, Nallatech, 
Texas Instruments, and Xilinx explain 
how they can help designers outsmart 
Moore’s law and dramatically reduce 
the development time and cost of 
high-performance DSP systems. 


§ Track B: Integrating High- 
Performance Embedded Processing 
Solutions. This track focuses on 
technologies of interest to embedded 
systems designers from companies 
such as Altium™, IBM, Wind River 
Systems, and Xilinx. 


@ Track C: Implementing Cost- 
Effective Connectivity Solutions. 
Seeking low-cost connectivity applica- 
tions? Cadence, Intel, and Xilinx pres- 
ent examples of gigabit Ethernet 
applications; implementation of high- 


speed parallel and multi-gigabit serial 
interfaces for chip-to-chip and back- 
plane communications; and designing 
with serial technology. 


@ Track D: Effective System Design for 
Programmable Systems. FPGAs pro- 
vide a variety of flexible and powerful 
capabilities and have evolved into true 
system platforms. This track offers 
engineers new and innovative infor- 
mation from Agilent, Celoxica, 
Mentor Graphics, Synopsys™, 
Synplicity, and Xilinx on how to 
extract the most performance from 
your system designs. 


Locations and Dates 


The schedule for Programmable World 2003 
workshops this year is: 


September 2003 
San Jose, CA: 
Woodland Hills, CA: 
San Diego, CA: 
Stuttgart, Germany: 
Munich, Germany: 
Stockholm, Sweden: 
Helsinki, Finland: 
Madrid, Spain: 
Paris, France: 


September 3 

September 4 

September 5 

September 9 

September 10 
September 11 
September 12 
September 15 
September 16 
September 17 
September 18 
September 19 
September 22 
September 24 
September 25 
September 26 


Eindhoven, Netherlands: 
Leuven, Belgium: 
Boston, MA: 

Ottawa, Canada: 
Toronto, Canada: 
Dallas, TX: 

Chicago, IL: 


October 2003 
Shanghai, China: 
Hsinchu, Taiwan: 
Seoul, Korea: 


November 2003 
Tokyo, Japan: 


October 20 
October 22 
October 24 


November 14 





Conclusion 

The Programmable World 2003 series 
of tradeshows, forums, and workshops 
can solve your  time-to-knowledge 
headaches by giving you access to critical 
information from industry leaders in inter- 
national venues. 

The topics are focused on providing cost- 
effective solutions to meet your time-to-mar- 
ket and performance goals. 

For more information or to register for 
workshops, visit www.xilinx.com/pw2003/ 
workshop/. 

Like the forum presentations, the first 
Programmable World workshop in San Jose 
will be available in the last quarter of 2003 as 


a video-on-demand. 
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DSP & FPGA Module 
- 64 bits 6(00MHz TMS320C6416 DSP 
- Xilinx Virtex Il FPGA XC2V6000 
- 256MB of SDRAM @100MHz and 8MB of FLASH 
- Six 400Mbytes/s Sundance High-speed Bus 
- Six ComPorts for InterDSP communications 





DSP & FPGA Module 
- Two 225MHz TMS320C6713 Floating Point DSP 
- Xilinx Virtex Il FPGA XC2V2000-4 
- 256MB of SDRAM @100MHz and 8MB of FLASH 
- Six ComPorts and two 400MB/s SHB ports 





FPGA Module 
- Virtex Il range of FPGA devices 
- Up to 8 million gates as a re-configurable resource 
- 4MB of high speed ZBT SRAM, 
- 2MB of high speed QDR-SRAM, 
- External Clock & LVDS Interface 
- Multiple FPGA/DSP support 





FPGA PMC Module 
- Virtex-Il FPGA from V4000 to V8000 
- 128MB of 133Mhz SDRAM, accessible from FPGA 
and PCI bus 
- 16MB, 200MHz DDR SRAM or QDR SRAM 
- 32MB, 250MHz DDR SDRAM and 16MB of Flash 
- 64 bits, 66MHz PCI interface 
- 4 programmable clocks generating up to 500MHz 
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Xilinx Alliance EDA 
Members and Products 


Alatek 


www.alatek.com 
Hardware acceleration (HES) 


Altium 
www.altium.com/products/nvisage.htm 
Complete PCB/FPGA design flow 
(nVisage) targeting Spartan™ Series FPGAs 


Apache Design 
www.apache-da.com/Products/nspice.htm 
HSPICE®-compatible simulator using 
actual S-parameter data (NSPICE) for 
Virtex-II Pro™ MGT-based PCB design 


Aplus 

www.aplus.com 

Architecture-specific synthesis and layout 
optimization for Virtex™ II series FPGA 


(PALACE™) 


Aptix 
www.aptix.com 
ASIC emulation (Prototype Studio™) 


Atrenta 

www.atrenta.com 

RTL Linter technology for Virtex-II series 
FPGAs (SpyGlass®) 


Cadence Design Systems 
www.cadence.com/featurelfpga_design.html 
Virtex-II Pro MGT support for high-speed 
PCB design (SPECCTRAQuest®); NCSim 
family for FPGA design simulation; SPW 
for DSP data path development on FPGAs 
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Celoxica 

www.celoxica.com 

C-level design creation, compilation, and 
verification for Virtex-II and Virtex-II Pro 


FPGAs 


Endeavor 
www.endeav.com 
Co-verification for Virtex-II Pro 


(CoSimple™) 


EVE 
www.eve-team.com 
RT L-level Simulator Accelerator (ZEBU) 


Forte Design 

www. forteds.com 

HLL compiler to RTL (Cynlib Tool Suite); 
timing specification and analysis tool 
(TimingDesigner) 


Future Design Automation 
www.future-da.com 

ANSI-C algorithms to RTL models 
(SystemCenter) 


Hierarchical Design Inc. (HDI) 

www. hierdesign.com 

Hierarchical floorplanning and analysis 
software for Virtex-II series FPGAs 


Mentor Graphics 

www.mentor.com 

Virtex-II Pro MGT support for high-speed 
PCB design (HyperLynx™, ICX™, 
Tau™); complete FPGA design flow using 
the ModelSim® family of simulators and 
Leonardo-Spectrum™/Precision® synthesis; 
Seamless® co-verification support for 
Virtex-II Pro 


Novilit 

www.novilit.com 

Design partitioning using ISE EDK 
(AnyWare) for Virtex-II series FPGAs 








Product Acceleration Inc. 

www.prodacc.com 

FPGA symbol generation 
(LiveComponent™); automated pin 
assignment optimization; PCB layer count 
control (DesignF/X) 


Simucad 
www.simucad.com 
Verilog™ simulator for FPGAs (Silos) 


Synopsys 

Wwww.synopsys.com 

Virtex-II Pro multigigabit transceiver 
support for high-speed PCB design 
(HSPICE®); FPGA Compiler II™ RTL 
synthesis for Virtex series FPGAs; 
VCS™(MX)/Scirocco™ (MX) simulators; 
Formality® formal verification, LEDA® 
RTL linter, and PrimeTime™ static 
timing analysis; high-level design using 
CoCentric™ System C Compiler and 
CoCentric System Studio 


Synplicity 

www.synplicity.com 

RTL synthesis (Synplify®), debugger 
technology (Identify®), and physical 
synthesis (Amplify®) for all Xilinx FPGAs 


Translogic 

www. translogiccorp.com 

FPGA symbol generation for fast PCB 
design iterations (FPGA Connector™) 
for Virtex-II series FPGAs 


Verplex 
www.verplex.com 
Formal verification technology for Virtex-II 


series FPGAs (Conformal™ LEC) 
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Xilinx AllianceCORE Third-Party 
IP Providers and Products 


Alcatel Internetworking, Inc. 
www.ind.alcatel.com/emac/ 
Ethernet media access controller cores 


Amphion Semiconductor Ltd. 
www.amphion.com 

DSP, wireless communications, and 
multimedia cores 


Barco-Silex 

www. barco-silex.com 

DSP cores, with a specialization in 
image-audio applications 


CAST, Inc. 

www.cast-inc.com 

General-purpose IP for processors, 
peripherals, multimedia, networking, 
encryption, serial communications, 
and bus interfaces 


Crossbow Technologies, Inc. 
www.crossbowip.com 

System Interconnect IP for on-chip and 
chip-to-chip multiprocessing 


Deltatec S.A. 

www.deltatec.be/ 

IP and services for digital imaging, DSP, 
multimedia, PCI, and datacom 


Derivation Systems, Inc. 
www.derivation.com 

IP cores for high assurance hardware/soft- 
ware applications and embedded systems 
products; 32-bit Java processor core 


Digital Communications Technologie 
www.dctl.com 
Embedded Java solutions 


Digital Core Design 

www.dcd.pl/ 

Microprocessor, microcontroller, 

floating point, and serial communication 
controller cores 


Dolphin Integration 

www.dolphin.fr! 

Bus interface, processor, peripheral, and 
DSP cores; EDA software 
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Duma Video, Inc. 

www.dumavideo.com 

Digital video, AES/EBU audio, MPEG 
and DCT cores; real-time MPEG-2 HD 
and SD compression 


elnfochips Pvt. Ltd. 
www.einfochips.com 
PCI, USB, 80186, multimedia, and 


wireless cores 


Eureka Technology 

www.eurekatech.com 

Silicon-proven IP for CPU interface, PCI, 
PCMCIA, memory control, and other 
peripheral functions for SoC designs 


GDA Technologies, Inc. 
www,.gdatech.com 
Hypertransport, PCI, Ethernet, 


design services 


GV & Associates, Inc. 
Wwww.gvassociates.com 

Boards and expertise in DSP, processor, 
software, analog, and RF design 


Helion Technology Limited 
www.heliontech.com 
Encryption and DSP algorithms; 


design services 


iCoding Technology, Inc. 
www.icoding.com 

Turbo Code-related error correction 
products including system design 


Intrinsyc, Inc. 

www.tintrinsyc.com 

Single board embedded computers 

(FPGA + external processor); peripheral IP 


Loarant Corporation 
www.loarant.com 
Proprietary 16-bit RISC CPU; embedded 


system design 


MEET Ltd. 
www.meet-electronics.com 

IP cores dedicated to industrial 
servomotor control 


Memec Core 
www.memecdesign.com/xilinx/ 
Peripheral, bus interfaces, encryption, 
and communications cores 
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ModelWare, Inc. 
www.modelware.com 
Datacom/telecom cores including ATM, 


HDLC, POS, IMA, UTOPIA, and bridges 
NewLogic GmbH 


www.newlogic.com 
Cores for wireless applications including 
Bluetooth and 802.11 


Paxonet Communications 
www.paxonet.com 

Solutions for interworking metro 
networking technologies to SONET 


Perigee, LLC 

www.perigeelle.com 

DSP and image processing cores; embedded 
firmware, GUI, and application software 
development 


Pinpoint Solutions, Inc. 
www.asic-design.com 
Standards-based telecom and datacom IP 


QualCore Logic, Inc. 
www.virtualipgroup.com 

Cores for microcontrollers, telecommunica- 
tions, datacom, PC peripherals, and bus 
interface standards 


RealFast Operating Systems AB 
www.realfast.se! 
Real-time systems and RTOS; operating 


system cores 


Robert Bosch GmbH 
www.can.bosch.com 
Automotive, CAN 


SOC Solutions, L.L.C. 
www.socsolutions.com 

Embedded pP and software; networking, 
wireless, audio, GPS, and handheld; 
peripheral cores 


SysOnChip, Inc. 
www.sysonchip.co.kr/ 

IP and products for COMA 
Cellular/PCS/WLL modem, FEC, 
and Bluetooth 


Telecom Italia Lab S.p.A. 
www.idosoc.com/soc/viplibrary/ 
Cores for ATM, optical networking, 
and switched digital video broadcast 
equipment 
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Tensilica, Inc. 

www.tensilica.com 

Configurable processor cores and 
development tools 


TurboConcept 

www.turboconcept.com 

Turbo Code forward error correction 
solutions 


Wipro, Ltd. 

www.wipro.com 

IP building blocks for technologies like 
IEEE 1394, USB, Bluetooth and IEEE 
802.11 


Xylon d.o.0. 

www.logicbricks.com 

Human-machine interfaces and industrial 
communication controllers 


Zuken, Inc. 

www.zuken.co.jp/soc! 

PCI2.2, 10/100 Ethernet, and Gigabit 
Ethernet cores 


PCI-X, VME, 12C, 
Video, LZS, PS2 
Design Services 
Prototyping 
Xilinx Xperts 


Reach over 70,000 
programmable logic 
professionals worldwide. 
Advertise tir the... 


j 
‘ | 
A 


olulsarel 
_ 


eymrtildeswithiarieiitelal 
Call: (800) 493-5551 
xcelladsales@aol.com 
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Reference Design Partners 


The Xilinx Reference Design Alliance 
Program builds partnerships with industry- 
leading semiconductor manufacturers to 
develop reference designs for accelerating 
our customer's product and system time to 
market. The goal is to build a library of 
high-quality, multicomponent system-level 
reference designs in the areas of networking, 
communications, image and video 
processing, DSP, and emerging technolo- 
gies and markets. 


Agere Systems 
www.agere.com 


AMCC 


Wwww.amcc.com 


Analog Devices 
www.analog.com 


Bay MicroSystems 
www. baymicrosystems.com 


BitBlitz Communications 
www.bitblitz.com 


Broadcom Corp. 
www.broadcom.com 


Centillium 
www.centillium.com 


Congnigine Corporation 
www.cognigine.com 


Cypress Semiconductor 
WWW. CYPTEss. com 


IBM Microelectronics 
www.ibm.com/us/ 


IDT 


www.idt.com 


Ignis Optics 


www.ignis-optics.com 
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Infineon Technologies AG 
www. infineontechnologies.com 


Intel Corporation 
developer. intel. com/design/network/ 


Intrinsity 
www.intrinsity.com/home.htm 


Micron Technology 


www.micron.com 


Motorola 
www.motorola.com 


Octasic Semiconductor 
www.octasic.com 


PMC-Sierra Inc. 


www.pmc-sierra.com/index.html 


SiberCore Technologies 
www.sibercore. com 


Teracross 
www.teracross.com/~web/ 


Texas Instruments 
www.ti.com 


TransSwitch Coporation 
www.transwitch.com/index_flash jsp 


Velio Communications, Inc. 
www.velio.com 


Vitesse 
www.vitesse.com 


West Bay Semiconductor Inc. 
www.westbaysemi.com 


ZettaCom 
www.zettacom.com 


Xanoptix 
www.xanoptix.com 
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XPERTS Partners 


3T BV 

www.3Tnl/ 

Design services; measurement instruments; 
fail-safe controlling; medical equipment, 
communication, and computer peripherals 


Accent S.r.l. 

www.Accent.it/ 

Complex FPGAs; SoCs; PowerPCs; 
IP platforms; software, PCB, and IC 


foundry services, from concept to silicon 


ADI Engineering 

www.adiengineering.com 

Reference designs; custom designs; design 
services focused on Intel IXA and XScale 


Advanced Electronic Designs, Inc. 
www.aedbozeman.com 

Design services and manufacturing support; 
hardware and software design for embedded 


systems 


Advanced Principles Group, Inc. 
www.advancedprinciples.com 

Design services and consulting; software 
algorithm acceleration and application- 
specific computing systems 


Alpha Data 

www.alpha-data.com 

PCI/PMC/CPCI FPGA boards; embedded 
applications using HDL or System 
Generator methodologies 


AMIRIX Systems Inc. 

www.amirix.com 

Design services for electronic product 
realization; high-performance digital boards; 
programmable logic; embedded software 


Andraka Consulting Group, Inc. 
www.andraka.com 

Exclusively FPGAs for DSP since 1994; 
applications include radar, sonar, digital 
communications, imaging, and video 


Array Electronics 

www.array-electronics.de/ 
SoC/FPGA/PCB designs in telecom, 
industrial, and DSPs; concepts; verification; 
prototypes; cores 
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Avnet Design Services 
www.ads.avnet.com 

Embedded processing FPGA design 
services; Virtex-II Pro; high-speed 
SERDES; IP core integration; timing 
closure; RLDRAM; RapidIO™ 


Baranti Group Inc. 
www.baranti.com 
Award-winning design/manufacturing firm; 


expertise in digital video, FPGAs 


Barco Silex 

www. barco-silex.com 

Total solutions consisting of IR FPGA, 
and board design 


Benchmark Electronics Inc. 

www.bench.com 

EMS/JDM team; product design, support, 
introduction, and launch into volume 
production 


Birger Engineering, Inc. 

www. birger.com 

Development of algorithms, electronics, 
and systems for digital imaging 


BitSim AB 

www. bitsim.com 

Design services; DSP, video, 3G, and 
WLAN designs; high-speed prototyping 
boards 


Bottom Line Technologies Inc. 
www.bltinc.com 

FPGA, product, and system design services; 
networking; data communications; 
telecommunications; video; PCI; signal 
processing; military; aerospace 


CES Design Services @ Siemens Program 
and System Engineering 

www, ces-designservices.com 
Platform-oriented SoC/FPGA/PCB and 
firmware design in telecom, industrial, and 
video processing; customized solutions; 
concept; verification; prototypes 


CG-CoreEl Programmable Solutions P. Ltd. 
www.cg-coreel.com 

Design services; telecom, networking, and 
processor IP and designs; turnkey FPGA; 
RTL design/validation 





PAGES 


Chess iT /Embedded 

www.chess.nl/ 

Embedded hardware and software design of 
products and services at the chip, board, 
and system level 


Colorado Electronic Product Design, Inc. 
www.cepdinc.com 

Designing embedded systems with 
FPGAs for DSP; interfaces for medical, 


consumer, aerospace, and military 


Comit Systems, Inc. 

www.comit.com 

High-end FPGA/SoC design services, 
including the integration of customer- 


developed and third-party IP 


ControlNet India Pvt Ltd. 
www.controlnetindia.com 

ASIC/SoC Analog/RF FPGA design 
services; IP development; domains: 
Ethernet, IEEE1394, USB, security, PCI, 
WLAN, Infiniband 


Convergent Design LLC 
www.convergent-design.com 
Audio/video design for professional/ 
consumer products 


DATA Respons ASA 

www. datarespons.com 

Professional design and development 
services for embedded hardware/software 
solutions; FPGA/digital hardware applica- 


tion specialists 


Deltatec International Inc. 
www.deltatec.be/ 

Design services; digital imaging applica- 
tions; broadcast video, image processing, 
and image synthesis 


Digicom Answers Pty Ltd. 
www.fpga.com.au/ 

Design services; experienced FPGA 
resources; design-on-demand; pre-design 
advice; reviews and problem solving 


Digital Design Corporation (DDC) 

www. digidescorp.com 

Design services and products; image pro- 
cessing; video; communications; DSP; 
audio; automotive; controls; interfaces 
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Dillon Engineering, Inc. 
www.dilloneng.com 

Design services; FPGA-based DSP 
algorithms; high-bandwidth, real- 

time digital signal and image processing 
applications 


DRS Tactical Systems West Inc. 

(formally Catalina Research) 
www.catalinaresearch.com 

Design services; FPGAs for DSP, including 
the manufacturer of Virtex reconfigurable 
computing boards 


Eden Networks, Inc. 
www.edennetworks.com 

Turnkey design services; telecom and net- 
working designs (Ethernet, SONET, ATM, 
VoIP) 


Edgewood Technologies, LLC 
www.edgewoodtechnologies.com 

Design services; FPGAs for telecom, DSP, 
embedded processors 


Enea Data AB 

www.enea.se/ 

Telecommunication, data communication, 
DSP, RTOS, automotive, defense, high- 


reliability, and consumer electronics design 


Enterpoint Ltd. 

www.enterpoint.co.uk/ 

Design services; development boards; video, 
telecommunications, military, and high- 


speed design 


ESD Inc. 

www.esdnet.com 

Design services; FPGAs for embedded 
systems, software/hardware design, PCB 
layout 


Evatronix S.A. 

www.evatronix.pl/ 

Design services; FPGA design (Virtex, 
Virtex-E, Spartan-II); embedded systems; 
SoC and SoPC designs 


Evermore Systems Inc. 
www.evermoresystems.com 

Design services; FPGA designs for commu- 
nications, audio/video, wireless, and home 
networking 
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ExaLinx 

www.exalinx.com 

Design services; ASIC emulation for high- 
speed communications and wireless appli- 
cations; reference designs 


Fidus Systems Inc. 

www,fidus.ca/ 

Design services; FPGAs; communications; 
video; DSP; interface conversions; ASIC 
verification; hardware; PCB layout; soft- 


ware; SI/EMC 


First Principles Technology, Inc. 
www.fptek.com 

Design/assembly/test services; FPGA + 
PCB + DSP/processor + system; imaging; 
video; test; satellite telecom; control 


Flexibilis Oy 

www,flexibilis.com 

IP development; FPGA design services; 
Linux device drivers 


Flextronics Design 
www,flextronics.com/design/ 

Complete product design and development 
services for high-speed communications; 
reconfigurable computing; imaging; 
audio/video; military 


GDA Technologies, Inc. 
www.gdatech.com 

A leading services company focused on 
designing systems, SoC, ASIC, FPGA, 
and IP 


HCL Technologies 

www.heltech.com 

High-speed board, ASIC, and FPGA 
design and verification services in avionics, 
medical, networking/telecom, and 
consumer electronics 


Helion Technology Limited 
www.heliontech.com 

Design services and IP for FPGA with a 
focus on data security, wireless, and DSP 


IDERS Inc. 

www.iders.cal 

Contract electronic engineering and EMS 
resource, providing complete product cycles 
ranging from specifications to production 
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Image Technology Laboratory Corporation 
www,.gazogiken.co.jp/ 

Design services; FPGAs for image process- 
ing; design tools: EDK, System Generator, 
and Forge 


INITEC AG 

www.initec.ch/ 

Design services for system design; FPGA 
applications in telecommunication systems 
and medical electronics 


Innovative Computer Technology 
ictconn.com 

Digital, analog, Xilinx FPGA, and software 
design solutions for customers in a wide 
variety of industries throughout the United 
States 


iWave Systems Technologies Private Ltd. 
www. iwavesystems.com 

Embedded hardware and software design; 
board design; FPGA and ASIC develop- 


ment services 


Liewenthal Electronics Ltd. 
www.liewenthal.ce/ 
Embedded systems design services; develop- 


ment of software and hardware including 
Virtex and Spartan FPGAs 


Logic Product Development 

www.logicpd.com 

Full-service product development, including 
industrial design, mechanical, electrical, and 
software engineering 


M&M Consulting 

hellnan@mmcons.com 

Design services; imaging; video and audio 
designs; high-density ASIC emulation 


Memec Design 


www.memecdesign.com/xilinx 


Focused on FPGA design for Xilinx 
technology, we have completed more 
than 1,300 designs worldwide 


Memondo Graphics 

www.memondo.com 

Implementation of digital signal processing, 
communications, and image processing 
solutions and algorithms over Xilinx FPGAs 
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Mikrokrets AS 

www.mikrokrets.no/ 

FPGA development comprising telecom- 
munication, network communications, and 


embedded systems; PCB development 


Millogic Ltd. 

www.millogic.com 

Design services and synthesizable 
cores; expertise in PCI, video, imaging, 
DSP, compression, ASIC verification, 
and communications 


Milstar Inc. 

www.Milstar.co.il/ 

Design services and manufacturers 
(industrial and military specs); DSP; 
communication; video; audio; 


high-speed boards 


Misarc srl - Agrate 

www.misarc.com 

Design services and manufacturers; 
telecommunications, SoC development, 
and test equipment boards using FPGA 


solutions 


Multi Video Designs 

www.mvd-fpga.com 

Design services; DSP for video, telecom- 
muications; SoC with Virtex-II/Pro and 
Spartan (PowerPC/MicroBlaze™); Xilinx 


training 


Multiple Access Communications Ltd. 
www.macltd.com 

Design services; products and consultancy; 
DSP for wireless communications 


Nallatech 

www.nallatech.com 

Embedded reconfigurable computers for 
high-performance applications in DSP, 
imaging, and defense; Xilinx Diamond 


XPERTS 


NetQuest Corporation 
www.netquestcorp.com 

Turnkey engineering design and produc- 
tion services in advanced high-speed 
embedded communications 


North Pole Engineering Inc. 
www.northpoleengineering.com 
Hardware and software design services for 


FPGA, ASIC, and embedded systems 
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Northwest Logic 

www.nwlogic.com 

PCI, PCI-X, and SDRAM controller IP; 
high-end FPGA design services 


NovTech Engineering 

www.novtech.com 

Board-level and IP cores; turnkey design 
services; imaging; communication; PCI 
cores; encryption 


NUVATION 

www.nuvation.com 

Design services; imaging and communica- 
tions for defense/security, medical, con- 
sumer, and datacom/telecom markets 


Plextek Ltd. 

www.plextek.com 

Design services; FPGA, DSP, radio, 
microwave, and microprocessor technolo- 
gies for defense and communications 


POLAR-Design 

www.polar-design.de/ 

High-end FPGA design for telecom, 
networking, and DSP applications 


Polybus Systems Corporation 
www.polybus.com 

Test patterns; customizable in-system 
FPGA test patterns; design services; net- 
working and communications 


Presco, Inc. 

www.prescoinc.com 

Twenty-five years of experience in custom 

electronics design/manufacturing for high- 
performance image processing, data com- 

munications, inkjet 


Productivity Engineering GmbH 
www.pe-gmbh.com 

Design services; PCI design; microproces- 
sor cores; obsolete component replace- 
ments; ASIC prototyping 


Programall Technologies Inc. 
www.ProgramallTechnologies.com 

Design services; embedded systems target- 
ing Virtex, Spartan, Virtex-II, and Virtex-II 
Pro FPGAs 


R&D Consulting 

972-9-7673074 

FPGA development; video; imaging; 
graphics; DSP; communications 
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Rapid Prototypes, Inc. 

www.FPGA.com 

Design services; high-performance reconfig- 
urable FPGA applications requiring maxi- 
mum speed or density using physical design 
principles 


RDLABS 

www.rdlabs.com 

FPGA/ASIC design; wireless communica- 
tions; 802.11b/a IP cores; Matlab, Sysgen, 
ModelSim; instruments HP1661A, 
HP8594E, HP54520A 


RightHand Technologies, Inc. 
www.righthandtech.com 

Design services; Virtex-I] Pro FPGAs, 
boards, and software for video gaming, 
storage, medical, and wireless 


Rising Edge Technology Inc. 
www.risingedgetech.com 

Complete design solutions from the ground 
up; FPGA interface designs 


Riverside Machines Ltd. 
www.riverside-machines.com 

Design services and project management; 
DSP. wireless, network, and processor 
development; Verilog, VHDL, SystemC, 


Specman 


Roke Manor Research 

www.roke.co.uk/ 

Design services; IPR ATM, DSP, imaging, 
radar, and control solutions for FPGAs 


Roman-Jones, Inc. 

www.roman-jones.com 

Design services; FPGAs for embedded 
microprocessors; PCI; DSP; low-cost 8051 
softcore 


Sapphire Computers, Inc. 
drudolf@voyager.net 

Engineering design consulting; high-speed 
digital and FPGA design 


Siemens AG, Electronic Design House 
www.eda-services.com 

Design services; system-on-chip; ASICs; 
FPGAs; IPs; DSPs; boards; EMC; 
embedded software; prototypes 


Fall 2003 





| IL I IN| XS 


Silicon & Software Systems Ltd. ($3) 
www.s3group.com 

World-class electronics design company 
uniquely combining IC, FPGA, software, 
and hardware design expertise 


Silicon Interfaces America Inc. 
www.siliconinterfaces.com 

Design services; develops IP in areas of 
networking, wireless, optical, datacom, 
interconnect, and microcontrollers 


Silicon System Solutions P/L 
www.silicon-systems.com 

Design services; FPGAs for very high-speed 
applications including communications and 
DSP applications 


Siscad S.p.A. 

www.siscad.it/ 

Xilinx FPGA design services; IP 
development and integration 


Smart Logic, Inc. 

www.smart-logic.com 

Design services; FPGAs for rapid 
prototyping including imaging, compres- 
sion, and real-time control 


SoleNet, Inc. 

www.solenet.net 

FPGA, board, and system designs for wire- 
less, telecommunications, consumer, and 
reference design applications 


so-logic electronic consulting 
www,so-logic.net 

Embedded systems (PowerPC, 
MicroBlaze); system-level design 
(Forge, System Generator, Handel-C); 
Xilinx training centers (Austria, 
Eastern Europe) 


Styrex AB 

www.styrex.se! 

Design services for embedded systems; 
programmable logic, hardware develop- 
ment, and microprocessor programming 


Synopsys Professional Services 
www.synopsys.com 

Design services to solve your design 
challenges in system-level design, RTL 
design, and physical design 
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Syntera AB 

www.syntera.se! 

Design services; FPGAs for radar, digital 
communications, telecom automotive, 
aerospace, and DSP 


Tachyon Semiconductor 
www.tachyonsemi.com 

Design services for portable computing, 
networking, and embedded systems 


Tao of Digital, Inc. 

www.taoofdigital.com 

High-speed, large-gate-count SoC designs 
using IP cores for maximum modularity 
and efficiency 


Technolution B.V. 
www.technolution.nl/ 
Innovative hardware and software solutions 


Thales Airborne Systems 
www.thalesgroup.com/airbornesystems 
Provider and integrator with high expertise 
in FPGA design for radars, DSP, and 


communications 
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TietoEnator R&D Services AB 

www. TietoEnator.com 

European design services for telecom, 
industrial IT, and automotive; Xilinx exclu- 
sive training provider in Scandinavia 


V-Integration 

www. V-Integration.com 

Providing first-pass success in complex 
FPGA designs; applications include imag- 


ing, communications, and interface design 


Williams Consulting, Inc. 

chipw@weiatl.com 

Embedded systems hardware and software; 
video; MPEG; control 


Zaiq Technologies, Inc. 

www.zaigtech.com 

Design; partitioning and timing conver- 
gence; VIP; platform experts for embedded 
computer, networking, wireless, and 
instrumentation 


LO) YANTAYLOLUI(OMm AOLUMOLX- MN oM III (elamer-icst-we 


Two Xilinx XC2V8000 FPGAs per Board 


- Six 2Mx18-bit QDR SRAMs per FPGA 

- Over 200 direct off-board digital I/O links 
- ChipScope Pro headers 

- 128Mbytes SDRAM per FPGA/DSP 


Two TigerSHARC DSPs 

- Floating-Point DSP 

- Inter-DSP and FPGA 
data links 

- 4Mbytes FLASH per 
DSP/FPGA 


CompactPCI 
- 32/64-bit PCI 
- Hotswap support 


Software & Tools 

- FFT, radar receiver, FIR, QR, 
Floating-point IP cores 

- Loader & diagnostic utilities 

- VHDL examples 

- Low level data communications 
libraries 

- VxWorks drivers 


Contact us for Information on: 


PMC, PCI, VME and Virtex-Il Pro Solutions 


For More Information Call 


USA 607 257 8678 
Europe/Asia +44 (0)1494 464432 
Israel +972 3 9518718 
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From Octo 


Learn how the |atest Xilinx technology can help you 
design cost effective solutions faster. 


Gain hands-on experience to speed your next development cycle. 


What Are You Waiting For? 
Register now for the event nearest you. 






LOCATIONS 


Albuquerque Cofarado Springs Hunterville Philadelphia St Lauls 
Atlanta Connecticut Irving Phoenix Syracuse 
Austin Dallas Kansas City Portland Tallahassee 
Baltimara Dayton Lot Angela Rochester Totonte 
Boston Denver Winnea pecs. Salt Lake Copy VARcoayver 
Calgary Dobra Moniraal Ban Diage 

Chicago FL Lauderdale Orlando San Jose 

Cleveland Houston Ottawa Saati 


Insight $= XILINX’ 





Call 888.488.4133 (x915) or register at 


wwe.insight.na.memec.com/x-test2003 
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Virtex™ Series Virtex-E: V50E -V300E Virtex: V50 - V600 

Virtex-Il: 2V40 - 2V250 Virtex-E: V50E - V600E 

Virtex-Il Pro: 2VP2 Virtex-Il: 2V40 - 2V500 
Virtex-Il Pro: 2VP2, 2VP4, 2VP7 


Spartan™ Series Spartanll/IIE: ALL (except Spartan-lI/IIE: ALL Spartan-lI/IIE: ALL Spartan-lI/IIE: ALL 


XC2S400E and XC2S600E) Spartan-3: 3550, 35200, 35400 Spartan-3: ALL Spartan-3: ALL 
Spartan-3: 3850, 3$200, 35400 


CoolRunner™ XPLA3 
CoolRunner-Il 


ALL 
XC9500™ Series ALL 
Yes 
Yes 


i 


Educational Services 


8 


Design Services 


es 
Sold as an Option Sold as an Option 


ALL 

ALL 

Yes Ye 
Sold as an Option 


Architecture Wizards Yes Yes Yes Yes 
DCM - Digital Clock Management 
MGT - Multi-Gigabit Transcievers 


Xilinx System Generator for DSP No Sold as an Option Sold as an Option Sold as an Option 


GNU Embedded Tools Yes (Available with optional EDK) Yes (Available with optional EDK) Yes (Available with optional EDK) 
GCC - GNU Compiler 
GDB - GNU Software Debugger 


Support Services 


Schematic Editor 


WindRiver Xilinx Edition 
Development Tools 
Diab C/C++ Compiler 
SingleStep Debugger 
visionPROBE II target connection 


Sold as an Option Sold as an Option Sold as an Option 


Xilinx Synthesis Technology (XST) 


& 


Synplicity Synplify/Pro Integrated Interface Integrated Interface 
‘es 
‘es 


Ye 
Integrated Interface (MS Windows) _ Integration Interface (MS Windows) 
Y 


Synplicity Amplify Physical 
Synthesis Support 


Yes 


Leonardo Spectrum Integrated Interface 


es 

es 

Integrated Interface Integrated Interface 

EDIF Interface EDIF Interface EDIF Interface 

CPLD CPLD CPLD (MS Windows) 

‘ 
es 
es 


Integrated Interface 


Synopsys FPGA Compiler II EDIF Interface 


ABEL CPLD (MS Windows) 


FloorPlanner Yes 


g 


Constraints Editor Yes 


& 


<< < < 


Timing Driven Place & Route Yes Y Yes 
a7 SE 


HSPICE Models* 
ModelSim XE II Starter** ModelSim XE II Starter** 


Ye: 

ModelSim® Xilinx Edition (MXE II) ModelSim XE I! Starter 
Ls 
hs 


MS Windows 
ModelSim XE II Starter** 


Static Timing Analyzer 


ChipScope Pro Sold as an Option 


Sold as an Option Sold as an Option 


Yes 
Yes 
Yes 
Yes 
FPGA Editor with Probe Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Y 
Y 
Ye 
XPower (Power Analysis) Ye 


Y 
Y 
Ye 
Y 
Ye 
3rd Party Equivalency Checking Support 
Y 


es 
es 
es 
Yes 

Yes 

‘es 

Yes 

SMARTModels for PPC and Rocket I/O Yes 
es 


Y 


C (MS Windows 2000/MS Windows XP) PC Only (MS Windows 2000/MS PC (MS Windows 2000/MS Windows —PC(MS Windows 2000/MS Windows 
Windows XP), Linux XP), Sun Solaris, Linux XP), Sun Solaris, Linux 


For more information on the complete list of Xilinx IP products, visit the Xilinx IP Center at http://www.xilinx.com/ipcenter 
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Spartan-ll 












































1.25 Gb/s GFP Framer Paxonet Communications AllianceCORE V-IIP_|_V-Il Sek | Selle 

2.5 Gb/s GFP Framer Paxonet Communications AllianceCORE V-IIP_ | V-Il 

3G FEC Package Xilinx LogiCORE V-ll_ | V-E V 

8b/10b Decoder Xilinx LogiCORE V-IIP_| V-Il_| V-E Vv S-3_| S-IIE | S-ll 
8b/10b Encoder Xilinx LogiCORE V-IIP_| V-IL_ | V-E V S-3_| S-IIE | S-ll 
8-Bit Multichannel GFP Framer Paxonet Communications AllianceCORE V-IIP_ | V-Il 

8-Bit Transparent GFP Framer Paxonet Communications AllianceCORE V-IIP_|_V-Il 

ADPCM, 1024 Channel Amphion Semiconductor Ltd. AllianceCORE V-ll 

ADPCM, 256 Channel Amphion Semiconductor Ltd. AllianceCORE V-ll V-E 

ADPCM, 512 Channel Amphion Semiconductor Ltd. AllianceCORE V-ll Vv 

ADPCM, 768 Channel Amphion Semiconductor Ltd. AllianceCORE V-ll Vv 

AES Decryption family Amphion Semiconductor Ltd. AllianceCORE V-ll_ | V-E V S-Il 
AES Encryption CAST, Inc. AllianceCORE V-IIP_| V-Il_| V-E S-3_| S-IIE 

AES Encryption Family Amphion Semiconductor Ltd. AllianceCORE V-ll_ | V-E V S-Il 
AES Standard Encryptor/Decryptor Helion Technology Limited AllianceCORE V-IIP_| V-IL_ | V-E S-3__| S-IIE 

AES Tiny Encryptor/Decryptor Helion Technology Limited AllianceCORE V-IIP_| V-IL_ | V-E S-3_| S-IIE 

ATM Cell Processor Paxonet Communications AllianceCORE V-IIP_ | V-Il 

AWGN - Additive White Gaussian Noise Xilinx LogiCORE V-IIP_ | V-Il 

Bit Stream Analyzer and Data Extractor Telecom Italia Lab S.p.A. AllianceCORE V-IIP_|_V-II S-IIE 









































Bluetooth Baseband Processor NewLogic GmbH AllianceCORE V-ll V 

Bluetooth Hardware Baseband Controller Wipro, Ltd. AllianceCORE V-ll Vv 

CDMA2000 Turbo Convolutional Codec Xilinx LogiCORE V-IIP_|_V-I S-3 

Convolutional Encoder Xilinx LogiCORE V-IIP_ | V-IL | V-E V S-3_ | S-IIE | S-ll 
Convolutional Encoder Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-I S-IIE 

CRC10 Generator and Verifier Paxonet Communications AllianceCORE Vv Si 
CRC32 Generator and Verifier Paxonet Communications AllianceCORE V Ss 
DES and DES3 encryption engine Memec Core AllianceCORE V-lL_ | V-E Vv S-IIE | S-ll 

DES Encryption CAST, Inc. AllianceCORE V-ll V-E Vv S-Il 

DES3 Encryption CAST, Inc. AllianceCORE V-IIP_ | V-IL V-E S-3 S-IIE 

Distributed Sample Descrambler Telecom Italia Lab S.p.A. AllianceCORE V S-ll si 
Distributed Sample Scrambler Telecom Italia Lab S.p.A. AllianceCORE Vv S-Il SI 
DVB Satellite Modulator Memec Core AllianceCORE V-ll | V-E V S-ll 

E1 Framer Paxonet Communications AllianceCORE V-IIP_ | V-Il S-IIE 

E1 Mapper Paxonet Communications AllianceCORE V-IIP_|_V-Il S-IIE 

Ethernet MAC, 1 Gigabit Wipro, Ltd. AllianceCORE V-ll 

Ethernet MAC, 1 Gigabit Full Duplex Alcatel Internetworking, Inc. AllianceCORE V-ll 








Ethernet MAC, 1 Gigabit Half/Full duplex with GMIl or Gigabit serial PHY 


Xilinx 


LogiCORE 


V-ll 


S-IIE 


Ethernet 1000BASE-X PCS/PMA 


Xilinx 


LogiCORE 





Ethernet MAC, 10 Gigabit Full Duplex with XGMII or XAUI 


Xilinx 


LogiCORE 


V-ll 





XAUI 


Xilinx 


LogiCORE 


<|<|</< 
vl u)/uvIU 
































Ethernet MAC, 10/100 Alcatel Internetworking, Inc. AllianceCORE V-ll S-ll 

Ethernet MAC, 10/100 CAST, Inc. AllianceCORE V-IIP_ | V-IlL | V-E S-IIE | S-ll 

Ethernet MAC, 10/100 Paxonet Communications AllianceCORE Vv S-Il S 

Ethernet MAC, 10/100 Zuken, Inc. AllianceCORE V-ll V-E 

Ethernet MAC, 10G Paxonet Communications AllianceCORE V-ll 

Ethernet PCS, 10G Paxonet Communications AllianceCORE V-IIP | V-ll 

Ethernet PCS, 10G Memec Core AllianceCORE V-ll 

G.709 Compliant FEC Core Paxonet Communications AllianceCORE V-IIP_ | V-Il 

HDLC, Single-Channel Memec Core AllianceCORE V-IIP | V-ll $-3 

HDLC, Single-Channel Memec Core Submitted V-IIP | V-ll 5-3 

HDLC-PPP for Packet over SONET Paxonet Communications AllianceCORE Vv S-ll 

HyperTransport Cave GDA Technologies, Inc. AllianceCORE V-IIP | V-ll 

HyperTransport Cave GDA Technologies, Inc. Submitted V-IIP_ | V-Il 

nterleaver/De-interleaver Xilinx LogiCORE V-IIP_ | V-IL | V-E V S-3_ | S-IIE | S-ll 
V-IIP 


interleaver Deinterleaver 
nterleaver/Deinterleaver 


Telecom Italia Lab S.p.A. 
Telecom Italia Lab S.p.A. 


AllianceCORE 
AllianceCORE 








V-ll 


S-IIE 


S-ll 





internet Appliance 


SOC Solutions, L.L.C. 


> 


lianceCORE 


V-ll 


V-E 





inverse Multiplexer for ATM 


ModelWare, Inc. 


AllianceCORE 


V-ll 


WAE 






































SPI-3 (POS-PHY L3) Link Layer Interface, 1-Ch 


Xilinx 


LogiCORE 


V-IIP 


V-ll 


V-E 


S-IIE 


S-ll 


SPI-3 (POS-PHY L3) Link Layer Interface, 2-Ch 


Xilinx 


LogiCORE 


V-IIP 


V-ll 


V-E 


S-IIE 


S-ll 





TU-T J.83 Annex B Modulator Xilinx LogiCORE V-IIP_ | V-Il S-3 
MD5 Message Digest Algorithm CAST, Inc. AllianceCORE V-IIP | V-ll $-3 S-IIE 
Noisy Transmission Channel Model Telecom Italia Lab S.p.A. AllianceCORE V-IIP_|_V-I S-IIE 
0C12 Framer Paxonet Communications AllianceCORE V-IIP_ | V-Il 
OC12c Path Processor Paxonet Communications AllianceCORE V-IIP_ | V-Il 
OTU2 Framer Paxonet Communications AllianceCORE V-IIP_|_V-Il 
Reed Solomon Decoder Xilinx LogiCORE V-IIP_ | V-IL_ | V-E V S-3_| S-IIE | S-ll Ss 
Reed Solomon Decoder Memec Core AllianceCORE V-ll Vv S-IIE | S-ll 
Reed Solomon Encoder Xilinx LogiCORE V-IIP_| V-Il_|_V-E Vv S-3_| S-IIE | S-ll S 
Reed Solomon Encoder Memec Core AllianceCORE V-ll V S-IIE | S-ll 
Reed-Solomon Decoder Telecom Italia Lab S.p.A. AllianceCORE V-ll Vv 
SDLC Controller CAST, Inc. AllianceCORE V-ll Vv 
SHA-1 Encryption processor CAST, Inc. AllianceCORE V-lL_|_V-E Vv S-IE | S-Il 
SPI-3 (POS-PHY L3) Link Layer Interface, 1-256Ch Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3_ | S-IIE | S-ll 
Vv 
V 
Vv 


SPI-3 (POS-PHY L3) Link Layer Interface, 4-Ch 


Xilinx 


LogiCORE 


V-IIP 


V-ll 


V-E 


S-IIE 


S-ll 





SPI-3 (POS-PHY L3) Physical Layer Interface 


Xilinx 


LogiCORE 


V-E 








SPI-4.1 (Flexbus 4) Interface Core, 1-Channel Xilinx LogiCORE V-ll 
SPI-4.1 (Flexbus 4) Interface Core, 4-Channel Xilinx LogiCORE V-ll 
SPI-4.2 (POS-PHY L4) Multi-Channel Interface Xilinx LogiCORE V-IIP. | V-Il 









































SPI-4.2 (POS-PHY L4) to SPI-4.1 (Flexbus 4) Bridge Xilinx LogiCORE V-ll 
SPI-4.2 (POS-PHY L4) to XGMIl (10GE MAC) Bridge Xilinx LogiCORE Vell 
SPI-4.2 Lite (POS_PHY L4) Xilinx LogiCORE Vell 53 
Visit the Xilinx IP Center for more details at www.xilinx.com/ipcenter 
Xcell Journal 93 


Wu 
fa) 
=) 
0 
mm) 
U 
= 
Wu 
ce 
Lu 
LL. 
TT) 
c 
a. 
a 
ra 
_ 
a 


2 S<XILINX’ 


The Programmable Logic Company™ 










































































































































































































































































wi 

Ge ee || or 

i= i= i 

£ £ |] £ 

Function Vendor Name = = 

a FF & 
STS192/STM64 Framer Paxonet Communications AllianceCORE V-IIP | V-Il 
STS192/STM64 Path Processor Paxonet Communications AllianceCORE V-lIIP | V-Il 
STS48 OTN Framer/Digital Wrapper Paxonet Communications AllianceCORE V-IIP_ | V-Il 
T1 Framer Paxonet Communications AllianceCORE V-IIP_ | V-Il S-IIE 
T1 Framer QualCore Logic, Inc. AllianceCORE 
Turbo Decoder, Convolutional, 3GPP Compliant Xilinx LogiCORE V-ll V-E Vv 
Turbo Decoder, Product Code Xilinx LogiCORE V-IIP | V-II 
Turbo Decoder Telecom Italia Lab S.p.A. AllianceCORE V-ll V 
Turbo Decoder, 3GPP. iCoding Technology, Inc. AllianceCORE V-ll V-E S-IIE 
Turbo Decoder, 3GPP. SysOnChip, Inc. AllianceCORE V-ll V-E 
Turbo Decoder, DVB-RCS iCoding Technology, Inc. AllianceCORE V-ll V-E Vv 
Turbo Decoder, DVB-RCS TurboConcept AllianceCORE V-ll V-E 
Turbo Encoder, Convolutional, 3GPP Compliant Xilinx LogiCORE V-ll V-E Vv 
Turbo Encoder, Product Code Xilinx LogiCORE V-IIP | V-Il 
Turbo Encoder Telecom Italia Lab S.p.A. AllianceCORE V-ll V S-ll 
Turbo Encoder, DVB-RCS iCoding Technology, Inc. AllianceCORE V-ll V-E Vv 
Turbo Product Code Decoder, 25 Mbps TurboConcept AllianceCORE V-ll | V-E 
UMTS ADDRESS Gen Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-Il_ | V-E S-HIE 
UMTS Turbo Decoder Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-Il_ | V-E 
Utopia 2 Master Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-Il S-IIE 
Utopia 2 Master Telecom Italia Lab S.p.A. AllianceCORE V-IIP | V-Il S-IIE 
UTOPIA Level-2 PHY Side RX Interface Telecom Italia Lab S.p.A. AllianceCORE V S-ll SI 
UTOPIA Level-2 PHY Side TX Interface Telecom Italia Lab S.p.A. AllianceCORE V S-ll Si 
UTOPIA Slave (CC1435S) Paxonet Communications AllianceCORE Vv S-ll S 
Viterbi Decoder, General Purpose Xilinx LogiCORE V-IIP | V-II V-E Vv $-3 S-IIE | S-ll 
Viterbi Decoder, IEEE 802-compatible Xilinx LogiCORE V-IIP | V-II V-E Vv S-IIE | S-Il 
Viterbi Decoder Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-Il_ | V-E S-IIE | S-ll 
XAPP 289: Common Switch Interface CSIX-L1 Reference Design Xilinx Reference Design | V-IIP | V-II 
Digital Signal Processing 
Bit Correlator Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-IIE | S-ll 
€320C25 Digital Signal Processing CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Cascaded Integrator Comb (CIC) Filter Xilinx LogiCORE V-IIP. | V-ll | V-E Vv S-IIE | S-Il 
Comb Filter Xilinx LogiCORE Ss 
CORDIC Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
Digital Down Converter (DDC) Xilinx LogiCORE V-IIP. | V-Il | V-E Vv S-IIE | S-Il 
Direct Digital Synthesizer (DDS) Xilinx LogiCORE V-IIP | V-ll |) V-E Vv $3 | SHIE| S-ll 
Fast Fourier Transform Xilinx LogiCORE V-IIP | V-Il S33 
FFT/IFFT for Virtex-Il, 1024-Point Complex Xilinx LogiCORE V-ll 
FFT/IFFT for Virtex-lIl, 16-Point Complex Xilinx LogiCORE V-ll 
FFT/IFFT for Virtex-Il, 256-Point Complex Xilinx LogiCORE V-ll 
FFT/IFFT for Virtex-Il, 64-Point Complex Xilinx LogiCORE V-ll 
FFT/IFFT, 1024-Point Complex Xilinx LogiCORE V-E Vv 
FFT/IFFT, 16-Point Complex Xilinx LogiCORE V-ll V-E Vv 
FFT/IFFT, 256-Point Complex Xilinx LogiCORE V-ll V-E Vv 
FFT/IFFT, 32-Point Complex Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-IIE | _S-ll 
FFT/IFFT, 64-, 256-, 1024-Point Complex Xilinx LogiCORE V-IIP_ | V-Il S-3 
FFT/IFFT, 64-Point Complex Xilinx LogiCORE V-E Vv 
FIR Filter using DPRAM elnfochips Pvt. Ltd. AllianceCORE V-ll | V-E V S-IIE | S-ll 
FIR Filter, Distributed Arithmetic (DA) Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
FIR Filter, Distributed Arithmetic Parallel Xilinx LogiCORE 5 
FIR Filter, Distributed Arithmetic Serial Xilinx LogiCORE S 
FIR Filter, MAC Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
LFSR, Linear Feedback Shift Register Xilinx LogiCORE V-IIP | V-lIl V-E Vv $-3 S-IIE | S-Il 
Oscillator, Dual-Channel Numerically Controlled Xilinx LogiCORE V-E Vv S-IIE | S-Il G 
Oscillator, Numerically Controlled Xilinx LogiCORE V-E Vv S-IIE | S-ll S 
Parallel Distributed Arithmetic FIR Filter elnfochips Pvt. Ltd. AllianceCORE V-ll | V-E V S-IIE | _S-ll 
Time-Skew Buffer, Nonsymmetric 16-Deep Xilinx LogiCORE Ss 
Time-Skew Buffer, Nonsymmetric 32-Deep Xilinx LogiCORE Ss 
Time-Skew Buffer, Symmetric 16-Deep Xilinx LogiCORE S 
Virtex-E DSP Hardware Accelerator GV & Associates, Inc. AllianceCORE 
Virtex-ll DSP Hardware Accelerator GV & Associates, Inc. AllianceCORE V-ll 
Virtex-ll DSP Hardware Accelerator GV & Associates, Inc. AllianceCORE V-ll 
Math Functions 
1s and 2s Complement Xilinx LogiCORE S 
Accumulator Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S73 |NeS-IIEN|eSall 
Adder Subtracter Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Divider, Pipelined Xilinx LogiCORE V-ll_ | V-E V S-IIE | S-ll |S 
Floating Point Adder Digital Core Design AllianceCORE V-ll V S-ll 
Floating Point Divider Digital Core Design AllianceCORE V-ll V S-ll 
Floating Point Multiplier Digital Core Design AllianceCORE V-ll | V-E V S-ll 
Floating Point Square Comparator Digital Core Design AllianceCORE V-ll V S-ll 
Floating Point Square Root Operator Digital Core Design AllianceCORE V-ll | V-E S-ll 
Floating Point to Integer Converter Digital Core Design AllianceCORE V-ll V S-ll 
Integer to Floating Point Converter Digital Core Design AllianceCORE V-ll V S-ll 
Integrator Xilinx LogiCORE s 
Multiplier for Virtex, Variable Parallel Xilinx LogiCORE V-E Vv S-IIE | S-Il 
Multiplier, Constant Coefficient Xilinx LogiCORE Ss 
Multiplier, Constant Coefficient - Pipelined Xilinx LogiCORE 5 
Multiplier, Dynamic Constant Coefficient Xilinx LogiCORE V-E V 
Multiplier, Parallel - Area Optimized Xilinx LogiCORE Si 
Multiply Accumulator (MAC) Xilinx LogiCORE V-IIP | V-Il V S-3 | S-IIE| S-ll 
Multiply Generator Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Registered Adder Xilinx LogiCORE Si 





Visit the Xilinx IP Center for more details at www.xilinx.com/ipcenter 
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a FF & 
Registered Loadable Adder Xilinx LogiCORE SI 
Registered Loadable Subtracter Xilinx LogiCORE Si 
Registered Scaled Adder Xilinx LogiCORE s 
Registered Serial Adder Xilinx LogiCORE Ss 
Registered Subtracter Xilinx LogiCORE Si 
Scaled-by-One-Half Accumulator Xilinx LogiCORE s 
Sine Cosine Look Up Table Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll Ss 
Square Root Xilinx LogiCORE Ss 
Twos Complementer Xilinx LogiCORE V-IIP | V-ll | V-E Vv S230) | ES NE} Soll 
Memories & Storage Elements 
ATM Utopia Level 2 Xilinx LogiCORE V-IIP 
Block Memory, Dual-Port Xilinx LogiCORE V-IIP | V-Il V-E Vv $-3 S-IIE | S-Il 
Block Memory, Single-Port Xilinx LogiCORE V-IIP | V-lIl V-E Vv $-3 S-IIE | S-Il 
CAM for Internet Protocol Telecom Italia Lab S.p.A. AllianceCORE V-IIP_ | V-Il S-IIE 
Content Addressable Memory (CAM) Xilinx LogiCORE V-IIP | V-ll | V-E Vv S-3 | SHIE| S-ll 
Distributed Memory Xilinx LogiCORE V-IIP | V-ll | V-E Vv S330 | 2S EN Sal 
FIFO, Asynchronous Xilinx LogiCORE V-IIP | V-IL V-E Vv $-3 S-IIE | S-ll 
FIFO, Synchronous Xilinx LogiCORE Si 
FIFO, Synchronous Xilinx LogiCORE V-IIP | V-IL V-E Vv $-3 S-IIE | S-ll 
Pipelined Delay Element Xilinx LogiCORE Si 
Registered ROM Xilinx LogiCORE 5 
Registered Single Port RAM Xilinx LogiCORE Ss 
SDRAM Controller Eureka Technology AllianceCORE V 
SDRAM Controller, DDR Memec Core AllianceCORE V-ll V-E S-ll 
SDRAM Controller, Pipelined Eureka Technology AllianceCORE V-IIP | V-Il $-3 S-IIE 
Microprocessors, Controllers & Peripherals 
1 Gigabit Ethernet MAC w/PLB interface Xilinx LogiCORE V-IIP 
10/100 Ethernet MAC Lite w/OPB interface Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S-IIE | S-Il 
10/100 Ethernet MAC w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv SIE || Sell 
16450 UART CAST, Inc. AllianceCORE V-ll | V-E Vv S-IIE | S-Il 
16450 UART QualCore Logic, Inc. AllianceCORE S 
16450 UART w/OPB interface Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S33) | S-IE) Sill 
16450 UART w/Synchronous Interface CAST, Inc. AllianceCORE V-ll V-E Vv S-IIE | S-ll 
16550 UART w/FIFOs CAST, Inc. AllianceCORE V-ll | V-E Vv S-IIE | _S-Il 
16550 UART w/FIFOs QualCore Logic, Inc. AllianceCORE S 
16550 UART w/FIFOs & synch interface CAST, Inc. AllianceCORE V-ll V-E Vv S-IIE | S-Il 
16550 UART w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S30 | CSE) Sal 
16-bit proprietary RISC Processor Loarant Corporation AllianceCORE V-IIP | V-Il S-3 | S-IIE 
2901 Microprocessor Slice CAST, Inc. AllianceCORE V-E Vv S-ll 
2910A Microprogram Controller CAST, Inc. AllianceCORE Vv S-ll S 
2D Fabric Evaluation and Demo Board Crossbow Technologies, Inc. AllianceCORE S-ll 
2-dimensional multiprocessing interface fabric Crossbow Technologies, Inc. AllianceCORE V-IIP | V-Il S-IIE | S-ll 
68000 compatible microprocessor CAST, Inc. AllianceCORE V-ll | V-E V S-3 
80186 compatible processor elnfochips Pvt. Ltd. AllianceCORE V-ll V 
8051 compatible microcontroller CAST, Inc. AllianceCORE V-IIP | V-lIl V-E Vv $-3 S-IIE | S-Il 
8051 compatible microcontroller Digital Core Design AllianceCORE V-ll Vv S-ll 
80515 High-speed 8-bit RISC Microcontroller CAST, Inc. AllianceCORE Vv 
8052 compatible microcontroller Digital Core Design AllianceCORE V-ll V S-ll 
80C51 compatible RISC microcontroller CAST, Inc. AllianceCORE V-E Vv S-ll 
8237 Programmable DMA Controller CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
8250 UART CAST, Inc. AllianceCORE VIVE SEN ESSzll 
8254 programmable interval timer/counter core CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
8254 Programmable Timer QualCore Logic, Inc. AllianceCORE Vv S-ll S 
8254 programmable timer/counter elnfochips Pvt. Ltd. AllianceCORE V-ll S-ll 
8255 programmable I/O controller elnfochips Pvt. Ltd. AllianceCORE V-ll S-ll 
8255 Programmable Peripheral Interface QualCore Logic, Inc. AllianceCORE Vv S-ll S 
8255A Peripheral Interface CAST, Inc. AllianceCORE Vv S-ll 
8259 Programmable Interrupt Controller QualCore Logic, Inc. AllianceCORE S 
8259A Programmable Interrupt controller CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Arbiter and Bus Structure w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Arbiter and Bus Structure w/PLB interface Xilinx LogiCORE V-IIP 
ATM Utopia Level 2 Master and Slave w/OPB Interface Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-IIE | S-ll 
ATM Utopia Level 2 Master and Slave w/PLB Interface Xilinx LogiCORE V-IIP 
BRAM Controller w/LMB interface Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-IIE | S-ll 
BRAM Controller w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
BRAM Controller w/PLB interface Xilinx LogiCORE V-IIP 
BSP Generator (SW only) Xilinx LogiCORE V-IIP 
CRT Controller CAST, Inc. AllianceCORE V-ll V-E S-IIE | S-Il 
DCR Bus Structure Xilinx LogiCORE V-IIP 
DR8051 RISC MicroController Digital Core Design AllianceCORE V-ll V S-ll 
External Memory Controller (EMC) w/OPB interface 
(Includes support for Flash, SRAM, ZBT, System ACE) Xilinx LogiCORE V-IIP | V-Il V-E Vv $-3 S-IIE | S-Il 
External Memory Controller (EMC) w/PLB interface 
(Includes support for Flash, SRAM, ZBT, System ACE) Xilinx LogiCORE V-IIP 
Generic compact UART Memec Core AllianceCORE V-IIP | V-Il S-3 
Generic compact UART Memec Core Submitted V-IIP_ | V-Il S-3 
GPIO w/OPB interface Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S73 | MeS-IIEN Mesa 
HDLC Controller (Single Channel) w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S-IIE | S-Il 
12C w/OPB interface Xilinx LogiCORE VoIP) | Vell || VEE Vv S30 |S IE) Sal 
Interrupt Controller (IntC) w/DCR interface Xilinx LogiCORE V-IIP. | V-Il | V-E Vv S-IIE | S-Il 
Interrupt Controller (IntC) w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv $-3 | SHIE| S-ll 
IPIF Address Decode w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-IIE | _S-ll 
IPIF DMA w/OPB interface Xilinx LogiCORE V-IIP | V-I | V-E Vv S-IE | S-Il 

Visit the Xilinx IP Center for more details at www.xilinx.com/ipcenter 
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IPIF Interrupt Controller w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-IIE | _S-ll 
IPIF Master/Slave Attachment w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S-IIE | _S-Il 
IPIF Read/Write Packet FIFO w/OPB interface Xilinx LogiCORE VNPE|POVINSEVEE Vv SHEN |EeSsIl 
IPIF Scatter/Gather w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S-IIE | _S-Il 
IPIF Slave Attachment w/PLB interface Xilinx LogiCORE V-lIP 
Java Processor, 32-bit Digital Communications Technologies, Ltd. AllianceCORE V-ll Vv S-ll 
Java Processor, Configurable Derivation Systems, Inc. AllianceCORE V-ll Vv 
JTAG UART w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Memory Test Utility (SW only) Xilinx LogiCORE V-IIP 
MicroBlaze Soft RISC Processor Xilinx LogiCORE V-IIP | V-ll | V-E Vv Se) || SHE | Sal 
MicroBlaze Source Code Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S-3 | S-IIE | S-ll 
MIPS system controller Eureka Technology AllianceCORE V-ll_ | V-E 
ML300 VxWorks BSP (SW only) Xilinx LogiCORE V-IIP 
Motor controller - 3 phase MEET Ltd. AllianceCORE S-IIE | _S-ll Si 
OPB2DCR Bridge Xilinx LogiCORE V-IIP 
OPB20PB Bridge (Lite) Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
OPB2PCI Full Bridge (32/33) Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
OPB2PLB Bridge Xilinx LogiCORE V-IIP 
Operating System Accelerator RealFast Operating Systems AB AllianceCORE V-ll S-IIE | _S-ll 
PIC125x fast RISC microcontroller Digital Core Design AllianceCORE V-ll V S-ll 
PIC1655x fast RISC microcontroller Digital Core Design AllianceCORE V-ll V S-ll 
PIC165X compatible microcontroller CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
PIC165x fast RISC microcontroller Digital Core Design AllianceCORE V-ll | V-E V S-ll 
PIC16C55X compatible RISC processor CAST, Inc. AllianceCORE V-ll | V-E V S-IIE | S-ll 
PicoBlaze (XAPP 213: PicoBlaze 8-bit Microcontroller 
for Virtex-E and Spartan-II/E Devices) Xilinx Reference Design V-E V S-IIE | S-ll 
PicoBlaze (XAPP 627: PicoBlaze 8-bit Microcontroller 
for Virtex-Il and Virtex-Il Pro Devices) Xilinx Reference Design | V-IIP | V-Il 
PLB2OPB Bridge Xilinx LogiCORE V-IIP 
PowerPC Bus Master Eureka Technology AllianceCORE V-IIP_ | V-Il S-3 | S-IIE 
PowerPC Bus Slave Eureka Technology AllianceCORE V-IIP_ | V-Il S-3 | S-IIE 
SDRAM Controller w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S:37)| SIE} Sal 
SDRAM Controller w/PLB interface Xilinx LogiCORE V-IIP 
SPI Master and Slave w/OPB interface Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Systems Reset Module Xilinx LogiCORE V-IIP 
Timebase/Watch Dog Timer (WDT) w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv $-3 | SHIE| S-ll 
Timer/Counter w/OPB interface Xilinx LogiCORE V-IIP | V-ll | V-E Vv S73 |eSs IIE) Sal 
UART Lite w/OPB interface Xilinx LogiCORE V-IIP_ | V-IL_ | V-E V S-3 | S-IIE| S-ll 
VxWorks Board Support Package (BSP) Xilinx LogiCORE V-IIP 
Xtensa-V Configurable 32-bit Processor Tensilica, Inc. AllianceCORE V-ll 
280 compatible programmable counter/timer core CAST, Inc. AllianceCORE V-ll_ | V-E V S-ll 
280 peripheral I/O controller CAST, Inc. AllianceCORE V-ll_ | V-E V S-IIE | _S-ll 
Z80CPU Microprocessor CAST, Inc. AllianceCORE V-ll V-E Vv $-3 S-Il 
Standard Bus Interfaces 
Arbiter Telecom Italia Lab S.p.A. AllianceCORE V-IIP | V-Il S-IIE 
CAN 2.0 B Compatible Network Controller Xylon d.o.0. AllianceCORE V-IIP | V-Il S-3 | S-IIE 
CAN Bus Controller 2.0B CAST, Inc. AllianceCORE V-IIP | V-Il $-3 S-IIE 
CAN with 32 mail boxes Robert Bosch GmbH AllianceCORE V-IIP_ | V-Il S-3_ | S-IIE 
Controller Area Network Memec Core AllianceCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
HyperTransport Single-Ended Slave Core Xilinx LogiCORE V-IIP_ | V-Il 
12C Bus Controller CAST, Inc. AllianceCORE V-ll V-E Vv $-3 S-ll 
12C Bus Controller Master Digital Core Design AllianceCORE V-ll | V-E S-ll SI 
12C Bus Controller Slave Digital Core Design AllianceCORE V-ll | V-E S-ll § 
12C Bus Controller Slave Base Digital Core Design AllianceCORE V-ll_ | V-E S-ll Si 
12C Two-Wire Serial Interface Master-Only Memec Core AllianceCORE V-ll | V-E V S-IIE | S-ll 
12C Two-Wire Serial Interface Master-Slave Memec Core AllianceCORE V-ll | V-E V S-IIE | _S-ll 
PCI 64-bit/66-MHz master/target interface Eureka Technology AllianceCORE V-ll_ | V-E 
PCI Express x1 Core Xilinx LogiCORE V-IIP 
PCI host bridge Eureka Technology AllianceCORE V-ll | V-E 
PCI, 64-bit Target Interface CAST, Inc. AllianceCORE V-ll V-E S-IIE 
PCI32 Interface Design Kit (DO-DI-PCI32-DKT) Xilinx LogiCORE V-IIP_ | V-IL_ | V-E V S-IIE | S-ll 
PCI32 Interface, IP Only (DO-DI-PCI32-IP) Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
PCI32 Single-Use License for Spartan (DO-DI-PCI32-SP) Xilinx LogiCORE S-3 | S-IIE| S-ll 
PCI64 & PCI32, IP Only (DO-DI-PCI-AL) Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
PCI64 Interface Design Kit (DO-DI-PCI64-DKT) Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-IIE | S-ll 
PCI64 Interface, IP Only (DO-DI-PCI64-IP) Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-IIE | __S-ll 
PCI-X 64/133 Interface for Virtex-II (DO-DI-PCIX64-VE). 
Includes PCI 64 bit interface at 33 MHz Xilinx LogiCORE V-IIP | V-Il 
PCI-X 64/66 Interface for Virtex-E (DO-DI-PCIX64-VE). 
Includes PCI 64 bit interface at 33 MHz Xilinx LogiCORE VE 
RapidlO 8-bit port LP-LVDS Phy Layer (DO-DI-RIO8-PHY) Xilinx LogiCORE V-IIP | V-Il 
RapidlO Logical (I/O) and Transport Layer (DO-DI-RI08-LOG) Xilinx LogiCORE V-IIP | V-lIl 
RapidlO Phy Layer to PLB Bridge reference design Xilinx Reference Design | V-IIP 
Serial Protocol Interface (SPI) Slave CAST, Inc. AllianceCORE Vl | V-E Vv S-IIE 
USB 1.1 Function Controller CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
XAPP653: Virtex-II Pro/Spartan-3 3.3V PCI Reference Design Xilinx Reference Design | V-IIP $-3 
Video & Image Processing 
BURST_PLL Pinpoint Solutions, Inc. AllianceCORE V-ll V-E $-3 S-IIE 
Color Space Converter, RGB2YCrCb CAST, Inc. AllianceCORE V-ll V-E Vv S-IIE | S-ll 
Color Space Converter, RGB2YCrCb Perigee, LLC AllianceCORE Vv S-ll S 
Color Space Converter, YCrCb2RGB Perigee, LLC AllianceCORE Vv S-ll S 
Compact Video Controller Xylon d.o.0. AllianceCORE V-ll V S-ll 
Discrete Cosine Transform elnfochips Pvt. Ltd. AllianceCORE V-ll | V-E V S-IIE | S-ll 


Visit the Xilinx IP Center for more details at www.xilinx.com/ipcenter 
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Discrete Cosine Transform, 2D Inverse CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Discrete Cosine Transform, Combined 2D Forward/Inverse CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Discrete Cosine Transform, Forward 2D CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Discrete Cosine Transform, Forward/Inverse Telecom Italia Lab S.p.A. AllianceCORE Vv S-ll 
Discrete Cosine Transform, forward/inverse 2D Barco-Silex AllianceCORE V-ll V-E Vv S-ll 
Discrete Wavelet Transform, Combined 2D Forward/Inverse CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Discrete Wavelet Transform, Line-based programmable forward CAST, Inc. AllianceCORE V-ll V-E Vv S-ll 
Huffman Decoder CAST, Inc. AllianceCORE V-ll V-E Vv S-IIE | S-Il 
PEG Fast Codec CAST, Inc. AllianceCORE V-IIP | V-I V-E Vv 5-3 
PEG Fast Codec CAST, Inc. Submitted V-IIP | V-lIl V-E Vv $-3 
PEG Fast Decoder CAST, Inc. AllianceCORE V-IIP | V-II V-E Vv S-IIE 
PEG Fast Encoder CAST, Inc. AllianceCORE V-IIP | V-Il V-E Vv 
PEG, 2000 Encoder CAST, Inc. AllianceCORE V-IIP | V-Il V-E 
PEG, Fast color image decoder Barco-Silex AllianceCORE V-ll V-E Vv 
PEG, Fast gray scale image decoder Barco-Silex AllianceCORE V-ll V-E Vv 
PEG, Motion Codec core V1.0 Amphion Semiconductor Ltd. AllianceCORE V-ll V-E Vv 
PEG, Motion Decoder core V1.0 Amphion Semiconductor Ltd. AllianceCORE V-ll V-E Vv 
PEG, Motion Encoder core V2.0 Amphion Semiconductor Ltd. AllianceCORE V-ll V-E Vv 
Longitudinal Time Code Generator Deltatec S.A. AllianceCORE V 
MPEG-2 HDTV | & P Encoder Duma Video, Inc. AllianceCORE V-ll 
MPEG-2 SDTV | & P Encoder Duma Video, Inc. AllianceCORE V-ll 
NTSC-COSEP Pinpoint Solutions, Inc. AllianceCORE V-ll V-E $-3 S-IIE 
Backplanes and Gigabit Serial 1/0 
Aurora 401: single lane, 32-bit interface Xilinx Reference Design | V-IIP 
WP160: Emulating External SERDES Devices with 
Embedded RocketlO Transceivers Xilinx White Paper V-lIP 
XAPP 649: SONET Rate Conversion in Virtex-II Pro Devices Xilinx Reference Design | V-IIP 
XAPP 651: SONET and OTN Scramblers/Descramblers Xilinx Reference Design | V-IIP 
XAPP 652: Word Alignment and SONET/SDH Framing Xilinx Reference Design | V-IIP 
XAPP660: Partial Reconfiguration of RocketlO Attributes 
using PPC405 core (DCR Bus) Xilinx Reference Design | V-IIP 
XAPP661: RocketlO Transceiver Bit-Error Rate Tester (BERT) Xilinx Reference Design | V-IIP 
XAPP662: Partial Reconfig. of RocketlO Attributes using PPC405 core 
(PLB or OPB bus) + RocketlO Transceiver Bit-Error Rate Tester (BERT) Xilinx Reference Design | V-IIP 
Basic Elements 
Binary Counter Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S230 ES NE} Sill 
Binary Decoder Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S3e ESE Sal 
Bit Bus Gate Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Bit Gate Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Bit Multiplexer Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
BUFE-based Multiplexer Slice Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
BUFT-based Multiplexer Slice Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Bus Gate Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
Bus Multiplexer Xilinx LogiCORE V-IIP | V-ll | V-E Vv S337) / SIE) Sal 
Comparator Xilinx LogiCORE V-IIP | V-ll_ | V-E Vv S230 |S NEI eSell 
FD-based Parallel Register Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
FD-based Shift Register Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
Four-Input MUX Xilinx LogiCORE Ss 
LD-based Parallel Latch Xilinx LogiCORE V-IIP_ | V-Il_ | V-E V S-3 | S-IIE| S-ll 
Parallel-to-Serial Converter Xilinx LogiCORE Ss 
RAM-based Shift Register Xilinx LogiCORE V-IIP_ | V-Il_| V-E V S-3 | S-IIE| S-ll 
Register Xilinx LogiCORE Ss 
Three-Input MUX Xilinx LogiCORE Ss 
Two-Input MUX Xilinx LogiCORE 5 
Prototype & Development Hardware Products 
CPU + FPGA (Virtex/Spartan-ll) MicroEngine Cards Intrinsyc, Inc. AllianceCORE 
CPU + FPGA (Virtex-II) MicroEngine Cards Intrinsyc, Inc. AllianceCORE V-ll 
JockoBoard SOC Virtex-II Prototyping Platform RealFast Operating Systems AB AllianceCORE V-ll 
LogiCRAFT Evaluation System Xylon d.o.0. AllianceCORE V-E S-ll 
PC/104-Plus Reconfigurable Module Board Derivation Systems, Inc. AllianceCORE 
uPCI Reference and Development Platform Intrinsyc, Inc. AllianceCORE 
XTENSA Microprocessor Emulation Kit Tensilica, Inc. AllianceCORE V-ll 
Visit the Xilinx IP Center for more details at www.xilinx.com/ipcenter 
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Avoid messy timing mistakes. = . 
Use Mentor Graphics” 2 
FPGA design tools. oS 


PRECISION’ SYNTHESIS | As any FPGA designer can tell you, achieving precise timing is their single most critical challenge. That's why 
Mentor Graphics” created Precision” Synthesis, a powerful new tool suite that lets designers close on timing faster than ever. Precision” 
Synthesis provides the most comprehensive analysis for complex FPGAs with the only complete, built-in incremental timing analysis. 
Quickly find and optimize the most critical paths and avoid frustrating trial-and-error design iterations. Visit www.mentor.com/fpga 
today or call 877.387.5873 for more information on how Precision"Synthesis can help you find the fastest path to a completed design. 


qMsnler 


©2003 Mentor Graphics Corporation. All Rights Reserved. Mentor Graphics is a registered trademark of Mentor Graphics Corporation. 





To maximize Sa vings on your 
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L today’s challenging business environment, cost reduc- 


tion is critical. Together, Xilinx and Synplicity deliver 
everything you need to optimize your FPGA designs — 
and your budget. 


By combining the performance of the best synthesis tool 
available, Synplicity’s Synplify Pro; with industry-leading 
Xilinx FPGAs, we give you the power to quickly meet critical 
design timing demands while minimizing area, resulting in 


a smaller device or lower speed-grade requirement. 


$= XILINX’ 


The Programmable Logic Company™ 


A slower speed grade alone could result in device savings 
of up to 25%. 


This unique partnership is the latest advancement in our efforts 
to minimize system design costs through technology innovations 
such as 90nm design rules, 300mm wafers for manufacturing, 
high-level function integration and ISE-tool productivity 
enhancements. With Xilinx and Synplicity, you get the best of 
both worlds — increased FPGA performance and decreased 


design costs. Visit www.xilinx.com/savings to learn more. 


<> 
Synplicity’ 


logo and Synplify Pro are registered trademarks of Synplicity Inc. All other trademarks and registered trademarks are the property of their respective owners. 
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